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Preface 


Figure 0.0.1. The|impossible cube. In mathematics and in life, although it 


sounds contradictory, the impossible is possible. Credit: Wikimedia Commons, 
authors: Original—Maksim, vector—Boivie. Licensed under Creative Com- 
mons Attribution Share Alike 3.0 Unported (https://creativecommons.org/ 


licenses/by-sa/3.0/deed.en) license. 


Philosophy about learning and teaching 
Lighten up about mathematics! 


Have fun. What activities do people get good at without the process of 
improving being unpleasant? At the top of the list of such activities are perhaps 
games| Why is that? Because they are fun! This leads us to the conclusion 


(inductive reasoning) that one of the main difficulties in exposition and teaching is 


to make the subject matter and presentation fun for the reader and student. 


Walk the path. How do we teach infants how to speak? Well, we don’t say: 
“This is the first rule of grammar: ....” Likewise, it doesn’t make sense to start 
teaching mathematical proofs by saying: “This the first rule of proving theorems: 


” 


1Disclaimer: Most of the author’s childhood was spent playing sports, pinball, video games, and 
chess. 


— 
1X 
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There’s a difference between knowing the path and walking the path. 
— Morpheus 


If we have to choose, we would rather have you walk the path of proving 
theorems than know the path of proving theorems. But, fortunately, you do not 
have to choose and you can do both. 


Make it natural and balanced. The best way to learn something is to make 
the process natural. To balance conscious thinking with subconscious thinking. Not 
to overthink things. To focus, but not to rush. To wander, but to come back. On 
one hand, ideas arise naturally, whose origins wherewith we do not know. On the 
other hand, ideas come from ideas. After all, imitation is the sincerest form of 
flattery! 


Don’t be formulaic. Cookbooks are comfortable to read. And by following 
them, we can make delicious meals! However, if we make this a cookbook on how 
to prove theorems, then aren’t we doing the same thing as making a cookbook on 
how to integrate functions, except perhaps on a higher level? 


Practical advice. These are our ideas, but we suggest that you take them 
with a grain of salt and follow your own path! 


(1) You don’t have to understand everything at once. Being confused and strug- 
gling are natural parts of learning. 


(2) We learn by watching and we learn by doing. With respect to watching, this 
book may give more detailed proofs of elementary results than other books 
on mathematical reasoning. We compensate this by including lots of solved 
problems as well as exercises with hints given at the end of each chapter. 


(3) Success is mostly based on passion, patience, and perseverance. In this sense, 
we support the|Mamba Mentality|or|John Wick’ level of dedication, tempered 
with a relaxed attitude. If it is not for this book, then whatever you are 
interested in! 


(4) No matter what our level is, we have the ability to greatly improve in learning 
a subject. Useful are: 
(a) Interest and curiosity in the subject. Ask questions while you are learning 
and reading! 
(b) Repetition. Imagine you are getting good at a video game. What do you 
do? Keep playing! 
(c) Watch and imitate (see the corresponding [Yogi-isms). In the video game 
analogy, watch good players play and copy their techniques. 


Content of this book 


Je suis désolé. ... I’ve heard it all before. 
— From ‘Sorry/ by Madonna 


A word to the wise: If you read this book, you will have to endure bad math 
puns and jokes and out-of-date pop culture references (we old-timers prefer to think 
of these as the classics). 


Now that we got that out of the way, foremost, the material in this book is 
not original. Anyone potentially teaching from this book will have heard it all 
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before. What distinguishes this book is more about what material we’ve selected, 
how we’ve decided to pedagogically present the material, and the informal style of 
writing while trying not to compromise the mathematical content of the book. 


In this book we learn how to read and write mathematical proofs. One of the 
most beautiful subjects in mathematics is number theory. The subject of number 
theory ranges from the most elementary mathematics to the deepest mathematics. 
Number theory also provides an arena in which students already have a strong 
intuition for the objects of study, so this choice makes it possible to build on that 
existing knowledge. These properties make it an ideal subject for us to study. One 
reason why mathematics can be deep and advanced is that it can be abstract and 
complicated. We will see how abstraction can be introduced to simplify proofs 
and how, in order to understand concrete problems, we are forced to understand 
abstract notions. Complicated proofs can be broken down into smaller and far 
less complicated proofs. Deconstructing complicated proofs can be helpful in their 
understanding. 


Some of the specific topics we cover are logic and implications, set theory, the 
arithmetic of integers, prime numbers, and algebraic structures. 


Some of the themes in understanding mathematics we emphasize are concep- 
tualization and visualization. 


This book is suitable to be used as a quarter or semester college course on math- 
ematical reasoning provided that one skips sections (and chapters!). The teacher 
should decide what sections to skip according to personal taste. We have put an 
asterisk * after certain sections which we feel may be of less priority. 


Style of this book 


To keep the discussion in the book lively, we do not always proceed in a linear 
manner. Indeed, you are likely familiar with many of the elementary rules and 
assumptions we use in mathematical reasoning. If we start from the beginning, 
you will likely be bored, at least temporarily. On the other hand, when we discuss 
concepts and methods that have not been introduced before, there is the risk you 
may not (at least initially) know what we are talking about. To remedy this, we 
will, when necessary or helpful, give forward references (i.e., references to material 
later in the book) or Wikipedia references (on the ‘World Wide Web), which are 
also |hyperlinked (such as the aforementioned World Wide Web and the word “hy- 
perlinked”!) to missing items. This will facilitate looking up information in the 
e-version of this book. 

We also include some awful jokes, bad puns, and esoteric popular culture refer- 
ences. Their intent is to make the book more lively, interesting, and broad. Mainly, 
we would like to encourage you to think about mathematics in your own way, 
especially writing your own proofs! 


Problem solving 


One of the best ways to learn mathematics (or just about anything for that matter!) 
is to do problems. Problems test our understanding. Problems force us to learn 
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the material better. Problems challenge us. Problems are fun (or not, depending 
on your viewpoint!). 
Polya |Pol14) has the following suggestions for “How to solve it”: 


(1) “You have to understand the problem.” 
(2) “Find the connection between the data and the unknown.” 
(3) “Carry out your plan.” 
(4) “Examine the solution obtained.” 
LaTeX 


This book is written in which is a mathematics word processing software. 
You may consider learning LaTeX to be able to type up solutions to problems! Here 
is a sample of how it works. Consider the following statements: 


I love the equation e™’ + 1 = 0. Beyond amazing is 
b 
(0.1) J f@yae = £0) - Fa). 


Here is the LaTeX code we used to write this: 


I love the equation $e7{\pi i} + 1 = 0$. Beyond amazing is 
\begin{equation} 

\int_a*b f£’(x) dx = f(b) - f(a) 
\end{ equation} 


For the reader who is interested in learning LaTeX, an internet search will yield 


many sources. One source is https://www.overleaf.com/learn 
Origins 


This book started as notes for a mathematical proofs class we taught for many 
years at the University of California San Diego using the book by Eccles [Ecc97]. 
The first few years, we used the book by Fletcher and Patty [FP96]. As such, this 
book is largely influenced by Eccles’s book. 


Further reading 


There are many directions the student may pursue after, or even before(!), reading 
this book. Below are a handful of classics, out of the many wonderful mathematics 
books in the literature: 


e Munkres, James R., Topology. Second edition. Prentice Hall, Inc., Upper 
Saddle River, NJ, 2000. xvi + 537 pp. 


e Massey, William S., Algebraic topology: An introduction. Harcourt, Brace & 
World, Inc., New York, 1967. xix + 261 pp. 


e Rudin, Walter, Principles of mathematical analysis. Third edition. Interna- 
tional Series in Pure and Applied Mathematics. McGraw-Hill Book Co., New 
York-Auckland-Diisseldorf, 1976. x + 342 pp. 
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e Spivak, Michael, Calculus on manifolds. A modern approach to classical the- 
orems of advanced calculus. W. A. Benjamin, Inc., New York-Amsterdam, 
1965. xii + 144 pp. 


e Ahlfors, Lars V., Complex analysis. An introduction to the theory of analytic 
functions of one complex variable. Third edition. International Series in Pure 
and Applied Mathematics. McGraw-Hill Book Co., New York, 1978. xi + 331 
Pp. 

e Serre, Jean-Pierre, A course in arithmetic. Translated from the French. Grad- 
uate Texts in Mathematics, No. 7. Springer-Verlag, New York-Heidelberg, 
1973. viii + 115 pp. 
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Notations and Symbols 


end of proof symbol 


equals 
defined to be equal to 


S a wishful thinking equality; usually wrong 
© smiley face, dad joke 
= congruent to 
~ related to (for a relation) 
< less than 
< less than or equal to; 
if H and G are groups, H < G denotes H is a subgroup of G 
> greater than 
> greater than or equal to 
€ an element of 
¢ not an element of 
Cc a subset of 
eB) subset, in the reverse direction 
a) intersection 
U union 
ier Xi intersection of a family of sets 
User Xi union of a family of sets 
+ addition 
multiplication 


++, x,0,@ group multiplication 


Dy sum 
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NS 
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alm 
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ComDiv(a) 
Div(a) 


e€ 


product 

the logical connective “and” 
the logical connective 
the logical connective “not” 

the logical connective “implies” 

the logical connective “if and only if” (biconditional) 
implies, in the reverse direction 

composition of functions 

exists 

for all 

does not exist 

the empty set 


binomial coefficient; read as “n choose k” 


factorial if after a number; if after a word, wow! © 

the (non-negative) square root of a non-negative real number x 
the characteristic function of a subset A 

the cardinality of the set of integers 

the cardinality of the set of real numbers 

Euler’s totient function 

a divides b 

the closed interval from a to b 

the open interval from a to b 

the set of real numbers x > a 

the set of real numbers x < 6 

a sequence 

the congruence class of a modulo m 

if a is a group element, the inverse of a 

if a is a group element, the subgroup generated by a (see (8-110)) 
the set difference A minus B 

argument of a complex number 

principal value of the argument 

the open ball of radius r centered at x 

the closed ball of radius r centered at x 

the boundary of the ball, a.k.a. sphere of radius r centered at x 
the complement of a subset 

the set of complex numbers 

set of common positive divisors of a and b 

set of positive divisors of a 


if the base of an exponential, Euler’s number; approximately 
2.71828; e is also used to denote the identity element of a group 


Notations and Symbols Xvil 


the set of even integers 

a function from X to Y 

(usually) the n-th Fibonacci number 

the set of functions from X to Y, a.k.a. Y* 

the restriction of a function to a subset A of the domain 
the pre-image of a subset B of the codomain 
f*({y}) 

the graph of a function f 

greatest common divisor 

the set of quaternions 

if referring to a complex number, the imaginary unit 
the identity function of a set X 

the image of a function f 

infimum 

set of injections from a set X to a set Y 

least common multiple 

limit 

natural logarithm 

the maximum 

the minimum 

modulo 

the p-th Mersenne prime 

the set of integers that are multiples of m 

the set {1,2,3,...,n} 

the set of odd integers 

typically a prime number 

the power set of X 

the set of k-element subsets of X 

the set of rational numbers 

the set of positive rational numbers 

Legendre symbol 

{0,1,2,...,m—1}, the set of remainders modulo m 
set of elements of R,, coprime to m 

the set of real numbers 

the set of positive real numbers 

the set of non-negative real numbers 

n-dimensional (real) Euclidean space 

the remainder function r : Z + Rm, also denoted by ry, 
the unit n-dimensional sphere 

the n-dimensional sphere of radius r 


XViil Notations and Symbols 


sup supremum 

XxxY the cartesian product of X and Y 

[x] the equivalence class of x € X, where X has an equivalence 
relation ~ 

x” the n-fold cartesian product of X 

x-y the Euclidean dot product 

|x| the Euclidean norm 

|X| if X is a finite set, its cardinality 

(x, y) an ordered pair 

(%1,%2,...,%,) an (ordered) n-tuple 

X] Nie, Xi if I is a set of indices of a family {x;} 

cr |x| the floor function 

cr [ax] the ceiling function 

Z, the set of integers 

Zt the set of positive integers 

Ze the set of non-negative integers 

Zim {[DJin,s (Lm: [2]m,---,[™— 1m}, the set of congruence classes 
modulo m 

zx, set of elements of Z,, with inverses (see (8.28)) 


Zz the complex conjugate of z (see (8.17i)) 


Down the rabbit-hole A 


‘From “Alice’s Adventures in Wonderland” by Lewis Carroll. Wikimedia Commons, Public 
Domain. 


Chapter 1 


Evens, Odds, and Primes: 
A Taste of Number Theory 


Figure 1.0.1. A double rainbow at Yellowstone National Park. Photo by 
Jingwei Xia. 


Trinity: It’s so beautiful. — From “The Matrix Resurrections” 


Goals of this chapter: To demonstrate that we can, without 
much background, understand some elementary aspects of a deep 
subject: prime numbers. To see some of the pitfalls of inductive] 
To give examples of deductive reasoning leading to the 
proofs of results. After reading this chapter, the reader should be 
able to write some simple proofs such as in the exercises. 


_| 
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Figure 1.0.2. (fl. 300 BC). Detail from Raphael’s The School of 
Athens. Wikimedia Commons, Public Domain. 


In this introductory chapter, we take a closer look at something we are all 
familiar with: even and odd integers. Perhaps a bit less familiar, but still something 
we know about, are prime numbers. Do you know|Euclid’s Theorem| that there are 
infinitely many prime numbers? If you know it, congratulations; you are truly 
knowledgeable[}] If not, you will first learn the proof of this wonderful theorem in 
this book? 


On the other hand, do you know the proof that there are infinitely many twin 
primes? As of this writing, nobody knows whether this statement is true or false 
although remarkable progress has been made on this conjecture (see {1.9*] below 
for the definition of “twin prime” and for the statement of the conjecture). The 
subject of prime numbers, while fascinating, at advanced levels can be not at all 
easy. So in this chapter we consider results that are either easy to state, easy to 
prove, or both. 

Math, perhaps like life, is a game. Imagine beginning to play a video game 
with 40 levels. At the beginning, level 38 looks very, maybe impossibly, difficult. 
Math, like some video games, can be thought of as being built on levels. Our goal 
will be to get good at the game of math. 


If we approach the game of math very formally, then proceeding through the 
levels of our game will necessarily take a lot of time and a lot of pages. So, typically, 
in a book we jump over some of the elementary levels. In this book, to make the 
discussion more lively, we give you, the reader, a sneak peak of the higher levels, 
including some famous unsolved conjectures! 


Instead of beginning with a more formal treatment of the basics of logic, we will 
first learn how to prove theorems by example. Our philosophy is that the easiest 
way to learn how to prove statements is by seeing how it is done. We encourage you 
to think independently and to work out proofs with the minimum amount of help 
that you need to work it out in a reasonable amount of time. For example, while 


1This type of sentence is patterned after the pattern of speech of chess Youtuber 
?That is, if you do not first click on the Wikipedia page link for this theorem! Euclid’s Theorem 
is also Theorem [4.12]in this book. 
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you are reading a proof, you may have an aha moment, where you get the idea of 
the proof and feel that you can finish off the proof by yourself. When this happens, 
we hope that you try to do this. At least at the beginning, it may be helpful to 
check your proof with the proof in the book or any other source. Typically, the 
proofs will be rather similar. If they are not, then you have an independent proof! 

In this chapter, as in every chapter, exercises are sprinkled throughout the text. 
We strongly suggest you work as many problems as you can. After straining for at 
least a few minutes, you may look at the hints at the end of the chapter. 


= 
oD 
2 


( 


© 
,OOO 


C 
¢ 


a 


'o' 
ww 


oO 
O 
ao 


-_ 


900000 


oO 
To. 
WwW 
fr 


N 
es} 


& 
: 


00¢ 
2) 


44 


b 
o 


IOC 


wi 
i 


OC 


o) 
NS 


9 


»00000 
SOOE 
sy 


00 


0000000 


otek: 
000 CG 
900000000 


¢ 
© 
> © © 


ao 
on 


OO<¢ 
a 
> 


= 
° 
o 


100000000¢ 
000000000 


> 
O<¢ 

a¢ 
O< 


re 
¢ 
¢ 


Figure 1.0.3. Can you decode the color coding of the integers from 1 to 100? 
This image is from Dan Finkel’s TEDx talk “Five Principles of Extraordinary 
Math Teaching”. 


1.1. A first excursion into prime numbers 
Not all math puns are awful, just sum. 


In this section we learn what a prime number is, we see all of the primes less 
than 100, and we formulate some naive conjectures on primes, which we prove are 
false. 


1.1.1. Prime numbers. Let Z denote the set of integers, including the positive 
ones, the negative ones, and zero, denoted by 0. For example, —17 and 8 are 
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integers, but 7, read as “pi”, is not. On this set Z we have the operations of 
addition + and multiplication - or x. For example 2+2=2-2=4and2-3=6. 


1.1.1.1. Definition of prime number. Let us consider positive integers. Primes 
are their multiplicative building blocks. We see that some numbers factor, such as 
6 = 2-3, 12 =4-3, and 35 = 5-7. Other numbers don’t factor, such as 17: we can 
write 17 = 1-17 =17-1, but there is no other way to write 17 as a product of two 
positive integers. With this in mind, we make the following: 


Definition 1.1. We say that an integer p > 2 is prime if the only way to factor 
p as the product of two positive integers is 


pe lip=p-4. 
That is, the only way of “breaking up” p as the product of positive integers is the 


trivial way. In other words, an integer at least 2 is prime if and only if it is 
“indivisible” multiplicatively. 


Remark 1.2. By “if and only if” we mean “exactly when”. We will see a formal 
definition of “if and only if”, which we do not need here, in Chapter[3] below. When 
the word “if” is used to make a definition, we actually mean “if and only if”. 


For example, 2 is prime since the only way to factor it as the product of positive 
integers is 2 = 1-2 = 2-1. Similarly, we see that 3 is prime since the only way 
to factor it as the product of positive integers is 3 = 1-3 = 3-1. One the other 
hand, even though the only way to factor 1 as the product of two positive integers 
is 1 = 1-1, we have that 1 is not a prime simply because 1 < 2. As will be evident 
later, one makes this choice that 1 is not a prime so that many results involving 
primes will be smoother to state. We see that 4 is not prime since 4 = 2-2 and 
since the positive integers 2,2 are not 1,4, in either order. Next, 5 is prime as 
5=1-5=5-1 is the only way to factor 5 as the product of two positive integers. 
But 6 is not prime since 6 = 2-3 and since the positive integers 2,3 are not 1,6, in 
either order. 


Exercise 1.1. Explain the color coding of the first one hundred positive integers 
in Figure |L.0.3| The topic of is a hint! If you are having difficulty with this 
exercise, continue reading and come back to it. 


Certain elementary facts about integers will be useful. 


Solved Problem 1.3. Prove that if a is a positive integer satisfying a #1, then 
a> 1, and in fact a > 2. 


Solution. Firstly, for this proof, we assume the basic facts that 1 is the smallest 
positive integer and 2 is the next smallest positive integer. In particular, there are 
no integers strictly between 0 and 1, and there are no integers strictly between 1 
and 2. 


Let a be a positive integer satisfying a 4 1. Since a > 0 and a is an integer, we 
have a > 1. Indeed, there is no integer a satisfying 0 < a < 1. Now, since a > 1 
and a # 1, we obtain a > 1. Now, since there is no integer a satisfying 1 < a < 2, 
we conclude that a > 2. 
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The O symbol indicates the end of the solution or proof. That was a relatively 
easy proof. And it was indeed a proof. That is, it was 


a logical argument which demonstrated the truth of a statement. 


Part of the key to our success was that we clearly understood beforehand what it 


means to be an integer. As Polya said, ‘You have to understand the problem.) 


Exercise 1.2. Let n be a positive integer, and suppose that a and b are integers 
such that n = ab. Prove: If a is equal to 1 or n, then b is equal to n or 1, 
respectively. Hint: Show that ifa=1, thenb=n. Similarly for 1 and n switched. 


Remark 1.4. In the above, we boldfaced the “if-then” and “or” natures of the 
statement. If-then statements are called implications (a.k.a. 
ments), and or is an example of a logical connective, all of which is studied in more 
detail in Chapter B] below. But we assume that since such statements are common, 
even in elementary mathematics, you are comfortable with their meanings. 


The exercise above is a short proof, but there is logical reasoning going on here! 
Usually we give hints to the exercises at the end of the chapter. However, since this 
is our first. “proof” exercise, we solve part of Exercise [L.2] here: 

Suppose a= 1. Thenn=a-b=1-b=b. This proves: If a= 1, then b= n. 

Characterization of non-primes: So what does it mean for an integer n not 
to be prime? Firstly, if n < 1, then n is not prime. So let us assume that n > 2. 
The following is a tweak to the characterization of not being prime. 


Lemma 1.5. An integer n > 2 is not a prime if and only if there exist integers 
a>1landb>1 such that n = ab. 


Proof. By Definition n is not prime precisely when there is another way to 
factor n by positive integers besides 1-n and n-1; that is, there exist positive 
integers a and b such that 


(1.1) n=ab, 


where a,b are not equal to 1,n in any order. 


Claim. The condition above that a,b are not equal to 1,n in any order is 
equivalent to the condition 


(1.2) a#1 and bAl. 


Proof of the claim. Firstly, if a 4 1 and b ¥ 1, then clearly a,b are not equal 
to 1,n in any order. 


Secondly and conversely, suppose that a,b are not equal to 1,n in any order. 
Then suppose for a contradiction that a = 1. By Exercise[I.2] we have b = n, 
contradicting that a,b are not equal to 1,n in any order. Since we have arrived at 
a contradiction to our last assumption that a = 1, we must have that a 4 1. 


The proof that b 4 1 is exactly analogous to the proof that a 4 1 by switching 
the roles of a and b in the proof. So we have proved the claim. 


6 1. Evens, Odds, and Primes: A Taste of Number Theory 


Since a and b are positive, by Solved Problem [3] condition (£2) is in turn 
equivalent to the condition that 


(1.3) a>1 and 6>1. 


This and the proved claim above complete the proof of the lemma. 


Remark 1.6. This was the first time we proved something by the method of 
contradiction. Namely, at one point in the proof we assumed that a = 1, and 
from this assumption we logically deduced a contradiction. This proves that the 
assumption a = 1 is false, so necessarily we have a 4 1. We will discuss proof by 
contradiction in more detail in Chapter B] 


For Lemma [1.5] we walked the path of a proof. By going over the logic of the 
proof, you will know the path of the proof better. There is no single right balance 
of walking and knowing proofs. The way you choose is your path. 


Exercise 1.3. Given that n = ab, show that condition (L.3) ona and b is equivalent 
to the condition 


(1.4) l<a<n. 


This is also equivalent tol <b<n. Hint: Use some basic facts about inequalities. 


1.1.1.2. Composite numbers. Non-prime integers greater than 1 are also called 
composite numbers. So, by Lemma |[l.5} an integer n > 1 is composite if and 
only if there exist integers a,b > 1 such that n = ab. 

If n = ab, where a and 6 are integers, then we say that a and 6 are divisors 
of n. Another way to say this is: An integer a is a divisor of an integer n if there 
is some integer b such that n = ab. We also say that a divides n. We can think 
of a dividing n as meaning that a is “contained” in n from the point of view of 
multiplication. 


Example 1.7. (1) 7 is a divisor of 56 since 7-8 = 56 and 56 is a composite 
number. 
(2) 1 is a divisor of any integer n since 1-n =n. 
Exercise 1.4. Show that 0 is not a divisor of any non-zero integer. 


Lemma 1.8. Ifa and n are positive integers and if a is a divisor of n, then 
(1.5) l<a<n. 


Proof. Let a and n be positive integers such that a divides n. Since a is a positive 
integer, we have a > 1 (it is not possible that 0 < a < 1). We also have that 
there exists a positive integer b such that ab = n. Since b > 1, this implies that 
a= FS f=n. 


Notice how most mathematical proofs, in contrast to computer programming, 
have (hopefully small) jumps in logic. For example, in the proof above, we used 
n 


the “elementary” fact that since n > 0 and b > 1 > 0, we have | < {. Fora 
justification of this fact, see (18) below. 


Given a positive integer n, we call 1 and n trivial divisors of n. If l<a<n 
is a divisor of n, then we call a a non-trivial divisor of n. 
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Example 1.9. 1 and 15 are trivial divisors of 15, whereas 3 and 5 are non-trivial 
divisors of 15. 


Exercise 1.5. Let n = ab be a positive integer. Prove that if a is a non-trivial 
divisor of n, then b is a non-trivial divisor of n. 


By Exercise [1.3] we have the following further tweak of the characterization of 
composite numbers. 


Corollary 1.10. Let n> 2 be an integer. The integer n is a composite number if 
and only if there exists a divisor a of n satisfying 1<a<_n, that is, if and only if 
n has a non-trivial divisor. 


Proof. Let n > 2 be a composite number. By Lemma [L.5] this is equivalent to 
there existing integers a > 1 and b > 1 such that n = ab. 

By Exercise [1.3] this implies that a is a non-trivial divisor of n. 

Conversely, suppose that n > 2 has a non-trivial divisor a. Since a is a divisor 
of n, there exists an integer b such that n = ab. Since 1 < a < n, by Exercise [3] 
again, this implies that 1 < b <n. This proves that n is a composite number. 


So, a composite number is an integer at least 2 which has at least one non- 
trivial divisor. On the other hand, a prime is an integer at least 2 which has no 
non-trivial divisors. For example, to prove that 5 is prime (as we claimed earlier), 
we just need to check that none of 2,3, 4 divide 5. To reprove that 6 is a composite 
number, we just need to observe that 2 divides 6 and that 1<2 <6. 


A visualization of prime versus composite number is given in Figure[L.L.1 


8 9 


Figure 1.1.1. Visualizing the integers 2 through 9 as primes or composite 
numbers. The composite numbers are 4 = 2-2,6=2-3,8=2-4,9=3-3. 
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1.1.1.3. The primes under 100. Having found which integers at most 6 are 
primes, if we continue in this way of checking for non-trivial divisors, then we can 
find the primes less than 100. 


Exercise 1.6. List all of the primes less than 100. Hint: We give the answer 
momentarily, but don’t peek! 


By Corollary [[.10] an integer p > 2 is prime if and only if p does not have a 
divisor a satisfying 1 <a <p. 


It is cool to picture primes in red and non-primes in green as in Figure [1.1.2] 


O01 02 03 04 05 06 O7 O08 O09 10 
11 12 13 14 15 #16 17 #18 19 20 
21 22 23 24 25 26 27 28 29 30 
31 32 33 34 35 386 37 38 39 40 
41 42 43 44 45 46 47 48 49 50 
51 52 53 54 55 56 57 58 59 60 
61 62 63 64 65 66 67 68 69 70 
71 72 73 74 75 76 77 78 79 80 
81 82 83 84 85 86 87 88 89 90 
91 92 93 94 95 96 97 98 99 100 


Figure 1.1.2. The first 100 positive integers: primes are in red and composite 
numbers are in green. 


Exercise 1.7. We make the following observations about Figure [L.1.2] 


(1) For the second, fourth, sixth, eighth, and tenth columns, all of the integers are 
composite except for the number 2. 


(2) For the fifth and tenth columns, all of the integers are composite except for 
the number 5. 


(3) All of the multiples of 3, except for 3 itself, are composite numbers. 


Explain the reason for why each of these observations is true. 


Now we belatedly list the primes less than 100. They are, as pictured in Figure 
[L.1.2] in increasing order: 
(1.6) 
2,3,5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97. 


There are 25 in all, so assuming that you live to one hundred years old, on one 
quarter of your birthdays you will be a prime number of years old. So at age 97 
you can say that you are in the prime of your life! 

Although, we have listed the primes less than 100, it is a good exercise to give 
a proof that the first several in our list are actually primes. 


Exercise 1.8. Prove that each of the integers 5,7,11,13,17 1s a prime. That is, 
show that each of these integers does not have any non-trivial positive divisors. You 
may use Theorem [1.34] below. 
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We remark that the prime|17| is a rather interesting integer. The address of the 
Mathematical Sciences Research Institute (MSRI) is 17 Gauss Way in Berkeley. 


Remark 1.11. To see that 5 is not a prime, we calculate that 3 = 2.5, 3 = 1-4 2, 


and 3 = 1.25, none of which are integers. (Namely, 2 < 2 < 3,1 < 3 < 2, and 


2 
1l< 3 < 2. Any real number strictly between two consecutive integers is not an 
integer.) Therefore, none of 2,3,4 divide 5. This proves that 5 is prime. 


1.1.2. Conjectures on primes—to prove or not to prove. That is the ques- 
tion! 

To understand any concept, such as primes, we should ask questions and guess 
and check statements pertaining and related to this concept. A guessed statement 


is called a/conjecture} 


1.1.2.1. Naive conjectures. Let us make a “gonzo” conjecture based on wishful 
thinking. 

An elementary fact is that all primes greater than 2 are odd; we will prove this 
in Theorem [1.19] (or Corollary [[.23) below, when we discuss even and odd in more 
detail. Moreover, positive powers of 2 are even, such as 2' = 2, 2? = 4, etc. So 
2” — 1 is odd for every positive integer n. Naively, based on these observations we 
can ask: 


Is 2” — 1 always prime for every positive integer n? 


Let us try the first few cases. For n = 1, we have 2” = 2—1 = 1, which is not 
prime. So we add the hypothesis that n > 2. Now, for n = 2, we have 2” —1=8, 
which is prime. For n = 3, we have 2” — 1 = 7, which is prime again. 


So we are quick (perhaps too quick!) to pose the following. 


Conjecture 1.12. [fn > 2 is an integer, then 2” — 1 is a prime. 


False conjectures are often easy to disprove. This statement is no exception! 
For n = 4, we have 2” — 1 = 15, which is not prime since 15 = 3-5. This suffices to 
disprove the conjecture, for there exists an integer at least 2, namely 4, for which 
the conjectured statement is false. In other words, if the conjecture were true, then 
15 would be prime, but it isn’t. So the conjecture is false. 


So, by the logical argument above, which is not much of an argument(!), we 
have proved: 


Proposition 1.13. Not all integers of the form 2” — 1 are prime, where n > 2 is 
an integer. 


Moreover (we leave it to you to check this), not all integers of the form 2” + 1 
are prime, where n > 0 is an integer. 


One of the goals of the book by Fletcher and Patty is to develop the 
reader’s ability to “... distinguish mathematical thinking from wishful thinking.” 
This is very important, and we would like to add to this statement that we would 
like to encourage wishful thinking (that is, conjecture making) with the qualifica- 
tion that the reader should logically check the wishful thinking to see if they can 
determine whether their conjecture is true or false. 
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Exercise 1.9. Prove or disprove the following conjecture: For every non-negative 
integer n, 3" +2 is a prime number. 


So, you have to decide whether the conjecture above is true or false. How did 
your thinking about the truth or falsehood of the statement evolve as you thought 
about the exercise? 


In general, mathematical results and conjectures come from observation. In a 
sense, mathematics is an experimental science. We look at mathematical objects 
and structures, such as prime numbers, and we search for patterns. We then make 
hypotheses about the existence of patterns based on our observations. Before we 
prove or disprove them, these hypotheses may be true or false for all we know. To 
prove a conjecture, we need to come up with a logical argument that establishes 
the statement of the conjecture. Typically, to disprove a conjecture, we just need 
to find a counterexample, that is, an object that does not fit the pattern we 
conjectured. More precisely, a counterexample to a statement is an example (a.k.a. 
special case) for which the statement is false. 


The previous paragraph exhibits a rational way of approaching conjectures. 
A less rational approach, as we saw in Conjecture [1.12] is what we describe as 
“gonzo mathematics”: rather arbitrarily hypothesizing patterns based on only a 
small amount of observation. In other words, jumping to conclusions. Here, due to 
mostly wishful thinking with only a little empirical evidence, we hypothesize pat- 
terns usually guided by beauty and simplicity, which are subjective characteristics. 

One advantage of making gonzo conjectures is that we gain experience and 
feedback. By determining whether they are true or false, we often are inspired to 
make better and less gonzo (i.e., less naive) conjectures. 


Thing One to Thing Two: Did you know that gullible isn’t a word? 


As in the imagined conversation above (between two Dr. Seuss characters), it 
is perhaps just as important to know what is false as it is to know what is true! 


1.1.2.2. Less naive conjectures that are still false. Going back to Conjecture 
[1.12] we may observe that 2 and 3 are primes, whereas 4 is a composite number. Is 
it possible that Conjecture [L.12]is true for n prime? Let’s see. For n equal to the 
primes 2,3,5,7 we obtain for 2” — 1 the integers 


3, 7,31, 127, 
which are all primes. (We show below that 127 is a prime.) In general, the larger 
the number, the harder it is to check whether or not it is a prime. 
Despite the optimistic calculations above, can you disprove the following con- 


jecture? 
Conjecture 1.14. If p> 2 is a prime, then 2? — 1 is a prime. 
To see if this conjecture has a chance to be true, we check the primeness of 


2? — 1 for the next prime p = 11. We obtain the integer 2047, which is equal to 
23 - 89) So we conclude that Conjecture [14] is false! So, it was a bit harder to 


3Indeed, 23 x 100 = 2300. So 23 x 90 = 2300 — 230 = 2070. Thus 23 x 89 = 2070 — 23 = 2047. 
Quick maths! 
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disprove this conjecture as compared to Conjecture [1.12| but by checking a few 
more primes we were able to do it. 


Summarizing, we have proved: 
Theorem 1.15. The following statement is false: 
“If p > 2 is a prime, then 2? — 1 is a prime.” 

In other words, there exists a prime number p such that 2? — 1 is not a prime. 
In particular this is the case for p = 11. 

Here are 2? — 1, color coded, for the the primes p < 20: 

(1.7) 3, 7, 31, 127, 2047, 8191, 131071, 524287. 

The moral of the story is that sometimes one aptly chosen example is enough 
to disprove a theorem, so it often makes sense to look for such an example. Even if 
you don’t find one, often the work of checking several examples may lead you to a 
pattern that leads to a proof. So, having made a conjecture, it makes sense to check 


the validity of a number of examples of the stated conjecture. Many conjectures 
fall by the wayside simply from checking some examples! 


1.1.2.3. A true conjecture about primes. Now that we have had so many misfires 
in formulating conjectures, it is nice to state true conjectures. A wonderful and true 
result is: 


Theorem 1.16. [fn is a positive integer such that 2” —1 is a prime, then n itself 


is prime! 


For the proof of this, see Exercise [10] or Theorem [3.40] below. 


To analyze the implication in Theorem [1.16] we’ll need some elementary logic, 
which we briefly discuss here and which will be discussed in more detail in Chapter 
[3] Consider an implication of the general form: 


If P, then Q. 

This statement is equivalent to its contrapositive implication (see 43.7] below): 
If not Q, then not P. 

For example, let p be an integer greater than 2, and consider the implication: 
If p is prime, then p is odd. 

Then this implication is equivalent to its contrapositive: 
If p is not odd (that is, p is even), then p is not prime. 


(Recall that we are assuming that p > 2.) 
The special case of the logical equivalence above that we find useful is: The 
implication in Theorem [1.16] is equivalent to its “contrapositive” : 


Let n be a positive integer. If n is a composite number, then 2” — 1 
is a composite number. 
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Regarding this last statement, which we have not proved yet and leave as an exercise 
below, let us look at some examples. We have that 2° — 1 = 63 and 63 = 3? - 7. 
We observe that the non-trivial divisors of 6 are 2 and 3 and that 2? — 1 = 3 and 
23 —1=7 are divisors of 63 = 26 — 1. Next, let us consider the composite number 
10 and we compute that 219 — 1 = 1023 and that 1023 = 3-11-31. Again, we 
observe that the non-trivial divisors 2 and 5 of 10 have the property that 27-1 = 3 
and 2° — 1 = 31 are divisors of 1023 = 21° — 1. We might as well try one more 
example to see the pattern more clearly: we compute that 2'? — 1 = 4095 and that 
4095 = 32-5-7-13. The non-trivial divisors 2, 3, 4, and 6 of 12 satisfy 


2?-1=3, 22?-1=7, 2*-1=15=3-5, 2®-1=63=37.7, 
which are all divisors of 4095 = 2!* — 1. All of this leads us to: 
Conjecture 1.17. If we have positive integers a,b,n satisfying n = ab, then 2-1 


and 2° —1 both divide 2” —1. Thus, if n is a composite number, then 2” —1 is a 
composite number. 


Exercise 1.10. Prove the conjecture. Hint: Apply polynomial long division, where 
the polynomial variable “x” is equal to 2*. Namely, divide the polynomial 2” —1 = 
x’ —1 (explain why this equality is true) by the linear polynomial 2* — 1 = 2 —1. 
Then explain why this proves the conjecture. 


Since the conjecture implies Theorem [1.16] this exercise provides a proof of the 
theorem. 


1.2. Even and odd integers 


What is more elementary than prime numbers, but nonetheless interesting, are 
the notions of even and odd. We discuss elementary properties of even and odd 
integers, including proving properties relating to addition, multiplication, and prime 
numbers. 


1.2.1. Properties of even and odd integers. 


Question: Do you know what’s odd? 
Answer: Numbers that aren’t divisible by two. 
Rebuttal: What are the odds you are even right? 


Statements involve concepts. We discuss concepts by making definitions. Even 
though we all know what even and odd integers are, we now give their formal 
definitions. 


Definition 1.18. An integer n is even if there exists an integer a such that n = 2a. 


For example, 98 is even since 98 = 2-49. That is, the integer a that exists 
satisfying 98 = 2a is a = 49. 
Two is the only even prime: 


Theorem 1.19. [fn > 2 is an even integer, then n is not prime. 


Proof. Since n is even, there exists an integer b such that 
(1.8) n = 2b. 
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Since 2b = n > 2, we have b > 1. This and 2 > 1 imply that n is not prime (by 
Lemma [L.5). 


Mini-analysis of the proof above: The proof is an example of a “direct” proof. 
Our aim was to show that n is the product of two integers a,b > 1. Using our 
hypothesis, we were able to do this with a = 2. 


Two mice, Mickey and Minnie, not quite satisfied with the mini-analysis (per- 
haps Mickey wasn’t satisfied with the earlier proofs either!), have a long and some- 


times silly Socratic dialogue] about the proof: 
How did you do that? It looked like magic. ’m not quite convinced. 


Well, I looked at the statement and saw that we have to prove that each 
even integer greater than 2 is not prime. So firstly, I took such an integer. 


Mickey: How do you take such an integer? Is there a store where I can buy them? 
Minnie: Actually, I conjured them out of thin air by using the word “let”. So we 
started with: 


Let n > 2 be an even integer. 


This is our hypothesis. 

Mickey: Seems impenetrable to me. How can you conjure something out of noth- 
ing? 

Minnie: It is like conjuring anything. For example, assume that we have a unicorn. 
We can always assume this. However, the key is to prove something about the 
unicorn. Since unicorns are mythical objects, we won’t get very far. So let’s stick 
to math. 


Mickey: I can agree to that! 

Minnie: Now, please give me an even integer. 

Mickey: Okay, how about the number 2? 

Minnie: Sorry! We want an integer that is greater than 2. 
Mickey: Okay, then, let’s take 46. 


Minnie: 46 is good. We have 46 = 2-23, so it is not prime since 2 > 1 and 23 > 1. 
Please give me another. 


Mickey: Let’s try 1363. 

Minnie: That’s not even. 

Mickey: How do you know? 

Minnie: Well, 1363 = 2- 681.5, but 681.5 is not an integer, so 1363 is not even. 


Mickey: Alright, I’m feeling bold, so now let’s take n, where n is even and greater 
than 2. 


Minnie: Great! Since n is even, by definition: 


There is some integer b such that n = 2b. 


4Silliness is not restricted to Socratic dialogues. For example, there is the Ministry of Silly Walks. 
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And not only that, since n > 2, we also get 
b=F>1. 
Mickey: That is all fine and dandy. Now what? 


Minnie: Now we need to observe that we are actually done with the proof! Why 
is that? Because we have proved that our hypothesized integer n is the product 
of two integers: a = 2 > 1 and b > 1. By the definition of prime number (more 
precisely, Lemma [I.5), we see that this implies that n is not a prime number! 


Mickey: So I give you an integer that is greater than 2 and equal to 2 times 
something. You tell me that this something is greater than 1. Since both 2 and 
this something are greater than 1, my integer, which is their product, is not prime. 


Minnie: You’ve got it, way to go! 
Mickey: You are awesome! (0 


Minnie: Thanks, so are you! 0 

As in the joke quoted at the beginning of this subsection, we have: 
Definition 1.20. An integer n is odd if n is not even. The of an integer 
refers to whether it is even or odd. 

For example: 
Lemma 1.21. The integer 3 is odd. 
Proof. Suppose for a contradiction that 3 is even. Then 3 = 2a for some integer 
a. Then a = 1.5, which is not an integer || So we have a contradiction to the sup- 


position that 3 is even. Now, a contradiction cannot follow from a true statement. 
So we conclude that 3 is not even. 


The argument above can generalized as follows. 


Lemma 1.22. Any even integer plus 1 is odd. In other words, if n is even, then 
n+1 is odd. 


Proof. Let m be an even integer. Then n = 2k for some integer k. Suppose for 
a contradiction that n+ 1 is not odd; that is, suppose n + 1 is even. Then there 
exists an integer @ such that n+ 1 = 2@. We conclude that 2k +1=n+1 = 2, 
which implies that 

1 = 2¢— 2k = 2(€—k). 
Hence, since ¢ — & is an integer, we have that 1 is even, a contradiction. (Alterna- 
tively, 2-—k = 4 is not an integer, a contradiction.) Therefore n + 1 is odd. 


Finally, we remark that we should be careful with how we state results. By 
Theorem [1.19] each of the statements in the following corollary, when interpreted 
correctly, is true! 


Corollary 1.23. All prime numbers are odd except one. Actually, all prime num- 
bers are odd except two. 


5 Alternatively, since 2 < 3 = 2a < 4, we have 1 < a < 2, so that a is not an integer. 
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a 


Indeed, in the first sentence we mean “... except one of the prime numbers”, 
while in the second sentence we mean “... except the prime number two”! So, in 
the statements of the corollary we were ambiguous. Having learned our lesson, from 
now on we will be more careful to write clear mathematical statements. 


If you are not sure about the proof of this corollary, for a hint see Exercise|3.28 
below in the chapter on implications and all that. 


1.2.2. Even and odd and addition. 


Question: How do you make seven an even number? 
Answer: Remove the “s”. 


In this subsection and the next, we discuss the properties of even and odd with 
respect to the arithmetic operations of addition and multiplication. To wit, we 
answer the question: How does parity interact with addition and multiplication? 
Here are the answers. 


Theorem 1.24. The sum of two even integers is even. In other words, if a and b 
are even integers, then the integer a+ b is even. 

Proof. Suppose that a and b are even integers. Then there exist integers k and @ 
such that a = 2k and b = 2é. We calculate that 

(1.9) at+b=2k+20=2(k+2). 


Since k+ is an integer (the sum of two integers is an integer), by (1.9) we conclude 
that a+ 6 is even. 


Regarding the proof of Theorem [1.24] if you prefer to think visually, we offer 
Figure [1.2.1] 


0 1 k-1 k k+l k+e 


Figure 1.2.1. A visual proof that an even number plus an even number is 
even: a = 2k and b = 2é implies a+ b = 2(k + £). 


Theorem 1.25. The sum of an even integer and an odd integer is odd. 


Proof 1 of Theorem This proof uses a fact which we will not prove until 
a fair bit later in the book. Thankfully, this fact is something you are most likely 
familiar with and take for granted as being true. 


Fact 1.26. Ifa is an odd integer, then there exists an integer k such that 
a=2k+1. 


That is, any odd integer can be written as an even number plus 1. 
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As an example, 17 is odd, and 17 = 2-8+1. See Corollary below for how 
this fact follows from the “Division Theorem” (Theorem[4.1] below). We now boldly 
proceed to prove the theorem. 


To boldly go where no person has gone before! — Star Trek 


Let a be an odd integer and let b be an even integer. Since a is odd, by the 
fact above, there exists an integer k such that a = 2k + 1. Since b is even, there 
exists an integer @ such that b = 2@. We calculate that 


(1.10) at+b=(2k+1)+2¢=2(k+2)41. 


Since k + @ is an integer, by (1.10) and Lemma[Z.22] we have that a+ 0 is odd. 


Proof 2, by contradiction, of Theorem [1.25] By the commutativity of addi- 
tion, we may assume that the first integer is even and the second integer is odd. So 
let a be an even integer and let b be an odd integer. Then there exists an integer 
k such that a = 2k. 


Suppose for a contradiction that a+ is not odd; that is, it is even. Then there 
exists an integer m such that a+ b= 2m. We calculate that 


(1.11) b= (a+b) —a=2m—- 2k =2(m—k). 


Since m — k is an integer (the difference of two integers is an integer), we conclude 
that b is even. This is a contradiction to our assumption. Therefore we conclude 
that a+ 6 is odd! 


0 
0 1 k-1 k k+1 


Figure 1.2.2. Visualization of an odd integer: a is odd implies that there 


exists an integer k such that a = 2k +1. 


Theorem 1.27. The sum of two odd integers is even. 


Proof. Let a and b be odd integers. By this hypothesis and Fact [I-26] there exist 
integers k and @ such that a = 2k +1 and b = 20+ 1. We calculate that 


(1.12) a+b = (2k +1) + (2041) =2(k +041). 


Since k + + 1 is an integer, we conclude from (1.12) that a + b is even. 


Exercise 1.11. Give visual proofs of Theorems [1.25] and [1.27] where the drawing 
is analogous to Figure [1.2.1] See also Figure [1.2.2] 


1.2. Even and odd integers 17 


1.2.3. Even and odd and multiplication. Observe that 6 is an even integer, 
and for every integer b, we have that 6b = 2(3b). Since 3b is an integer, we conclude 
that 6b is an even integer for every integer b. In general, we have the following. 


Theorem 1.28. The product of any integer and an even integer is even. 
Proof. Without loss of generality, we assume that the first integer is even. Let a 


be an even integer and let b be an integer. Since a is even, there exists an integer 
k such that a = 2k. We calculate that 


(1.13) ab = (2k)b = 2(kb). 
Of course, kb is an integer. So (L13) proves that ab is even. 


Exercise 1.12. Show that Theorem [1.28] may be restated as: 
If 2 divides a and if b is an integer, then 2 divides ab. 


A visual proof of Theorem [ZL.28]is given by Figure [2.3] 


b 


0 2 2-2 2(k —1) 2k 


Figure 1.2.3. A visual proof of Theorem Suppose a = 2k for some 
integer k. We see that there are an even number of total squares in the 
rectangle representing the product. Can you think of an easier visual proof by 
dividing the a x b rectangle into only two subrectangles? 


Theorem 1.29 (An odd fact). The product of two odd integers is odd. 


Proof. Let a and 6 be odd integers, by Fact there exist integers k and @ such 
that a = 2k +1 and b= 204 1. We calculate that 
(1.14) ab = (2k +.1)(2€4+1) = 2(2kl+k +2) +1. 


Since 2k€+k + £ is an integer, from this we conclude that ab is odd. 


Exercise 1.13. Give a visual proof of Theorem |L.29 


By taking the two integers to be equal in each of the previous two theorems, 
we have: 


Corollary 1.30. (1) The square of an even integer is even. 


(2) The square of an odd integer is odd. 
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Direct proof of part (1) of Corollary Let n be an even integer. By def- 
inition, n = 2k for some k € Z. We compute that 


(1.15) a? = Qk) = 2+ 2k? 


Since 2k? € Z (the product of integers is an integer), we conclude that n? is even. 


Exercise 1.14. Prove: The cube of an even integer is even. The cube of an odd 
integer is odd. 


Exercise 1.15. Prove that if the product of two integers is even, then at least one 
of the integers is even. 


1.3. Calculating primes and the sieve of Eratosthenes 


Now we return to the subject of primes, our first love. Wouldn’t it be nice to be 
able to rather easily figure out which numbers are primes? Fortunately, for numbers 
that are not too large, this is possible even if you are cast away on a deserted island 
with only food, water, pencil, and paper! 


1.3.1. A shortcut for proving that an integer is a prime. To solve Exercise 
for the prime 11 for example, you presumably checked for all divisors a with 
1<a< 11 and came up empty. Interestingly, you only need to know that 2 and 3 
do not divide 11 to show that 11 is not a prime. You will see why this is true from 
our discussion below. 


Recall the following elementary fact about inequalities: Let a,b,c be real num- 
bers and suppose that c > 0. 


(1.16) Ifb >a, then cb > ca. 


For example, if b > a, then 3b > 3a. For another useful example, suppose that a 
and b are positive real numbers. By taking c = +, we obtain: 


il 1 
(1.17) Ifb>a>0, then 25 >0. 
a 
By combining (1.16) and (L17), we obtain: 
(1.18) Ifb>a>O0andm>0, then > =. 
a 


Another result that we will find useful is: 


Fact 1.31. For every integer n > 2, there exists a prime p dividing n. 


This is Corollary [1.55] below. In particular, if n is a prime, then we simply take 
p=vn. On the other hand, if n is a composite number, then this says that n has a 
prime divisor. For example, for n = 24 we may take p = 2 or p=3. 

We now return to the unfinished business from 41.1.2.2jof showing that 127 is a 
prime. Suppose that 127 = ab, where a and b are positive integers. 
of generality, we may assume that b > a. We then get that (simply take c = a in 


(1-16)) 
(1.19) Di =mSn=e. 
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Since a? < 127, by taking square roots, we obtain 
(1.20) a< V127. 


More generally, if a? < b? and b > 0, then a < b. In this paragraph we proved that 
if 127 is the product of two (non-trivial) factors, then the smaller of the two is at 
most the square root of 127. 


From (1.20) we deduce that 
(1.21) a< V127 < V144 = 12. 


Since a < 12 is an integer, we have that a < 11. To summarize, we have proved 
that if 127 is a composite number, then 127 has a divisor a satisfying 1 < a < 11. 
In other words, if 127 does not have a divisor a satisfying 1 < a < 11, then 127 is 
prime. Indeed this is the case and we leave it to you, the reader, to check this. 


We can make the job of proving that 127 is a prime even easier. We have the 
following: 


Fact 1.32. [f 127 is not prime, then 127 has a prime divisor 1 <p< 11. Equiv- 
alently: If 127 does not have a prime divisor 1 < p< 11, then 127 is prime. 


The reason this is true is as follows. As we have seen above, if 127 is not prime, 
then it has a divisor a satisfying 1 < a < 11. Since a > 1, there exists a prime p 
dividing a (see Corollary [1.55] below). Since p divides a, we have p< a< 11. Also 
because p divides a and since a divides 127, we have that p divides 127 (by the 
“transitivity of division”). This proves Fact 


For a more general and slightly more detailed version of this argument, see the 
proof of Theorem below. 


Exercise 1.16. Prove that 11 is prime by showing that it is not divisible by 2 or 
by 3. Include a justification of why this suffices. 


Does this exact same method work for every integer strictly between 1 and 16? 
Explain why or why not. 
As we have seen in the discussion above, taking square roots is useful. 
I poured root beer into a square cup. Now I have beer. 


A mathematician translates this quote to the equation 


(Vbeer)? = beer. © 


Generalizing what we did for the number 127 above, we observe the following. 


Lemma 1.33. An integer n > 1 is composite if and only if there exists a divisor a 
of n satisfying 1<a< Jn. 


Proof. An integer n > 1 is composite if and only if there exist integers a > 1 and 
b > 1 such that ab = n. Without loss of generality, we may assume that a < b. 
This implies that a? < ab =n, so that a < \/n. 

Conversely, suppose that n > 1 is an integer with a divisor a of n satisfying 
1l<a< Jn. Since n > 1, we have 1 <a <n. By Corollary [E10] this proves that 
n is a composite number. 
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In fact, we have an even nicer statement: 
Theorem 1.34. An integer n > 1 is composite if and only if there exists a prime 
divisor p of n satisfying p < \/n. 

Equivalently, an integer n > 1 is prime if and only if for every prime p < /n, 
p does not divide n. 


Proof. Let a > 1 be an integer. By Corollary [1.55] a has a prime divisor p; that 
is, there exists a prime p such that a = pk for some integer k. 


Now assume in addition that a < \/n and a is a divisor of n. Then there exists 
a positive integer b such that n = ab. Thus, 
n=ab=pk-b=p-kb, 


where p is prime and p < kb. Therefore, p is a prime divisor of n satisfying p < \/n. 

To summarize, we have proved: If 1 < a < ./n is a divisor of n, then there 
exists a prime divisor p of n satisfying p < /n. 

Conversely, clearly if there exists a prime divisor p of n satisfying p < ./n, then 
there exists a divisor a of n satisfying 1 < a < \/n, namely a = p. 

Finally, by Lemma[I.33] an integer n > 1 being composite is equivalent to there 
existing a divisor a of n satisfying 1 < a < /n. So we are done. 


Remark 1.35. Regarding the equivalence of the two statements in the theorem 
above, we observe it is true by the following general fact, which will be proved 
in Chapter B] A statement of the form “P if and only if Q” is equivalent to the 
statement “(not P) if and only if (not Q)”. 


Theorem [1.34] allows us to efficiently decide for example if integers less than 
1024 = 2!° are prime, as we only need to see if they are divisible by the primes less 
than 32 = 2°. 


We may also use Theorem[1.34]to help find the prime factorizations of numbers. 
For example, to find the prime factorization of 2074, by Theorem we know that 
if 2074 is not prime, then it has a prime divisor at most 2074 = 45.5. Indeed, 
since 2074 is even, 2 is a divisor of 2074. Dividing 2074 by 2 yields 1037. Now, if 
1037 is not prime, then it has a prime divisor at most ¥1037 ~ 32.2. It turns out 
that 17, which is less than 1037, is a prime divisor of 1037. Dividing 1037 by 17 
yields 61. Finally, 61 is a prime, which can be verified by showing that it has no 
prime divisors at most 61. Since 61 < 8, to show that 61 is prime, we just need 
to check that none of the primes 2,3,5,7 divide 61, which is true. By combining 
all of the above, we obtain 


(159) 2074 = 2- 1037 = 2-17-61, 


where the last equality gives the prime factorization of 2074. 
Exercise 1.17. Let a and b be positive real numbers satisfying ab = 10000. Prove 
that a < 100 or b < 100. 

Hint: Can you derive a contradiction if a > 100 and b > 100? Recall that we 


used|proof by contradiction) in the proof of Lemma [L.21] 
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0 
0 1 b-1 bd 0 vn 


Figure 1.3.1. If n = ab and a < b, then a < Wn. The rectangle and the 
square have the same area n. 


Commentary: You may have to use some (elementary) rules of logic, which you 
are likely familiar with. These logic rules are also discussed more formally later in 
the book. See, e.g., Chapter Bh 
Exercise 1.18. By the proof of Lemma {1.33} we have: If n = ab, where a,b are 
positive integers, then at least one of a,b is less than or equal to \/n. 

Now, suppose that n = abc, where a,b,c are positive integers. Prove that at 
least one of a,b,c is less than or equal to Wn. 


Example 1.36. If n is a positive integer less than or equal to 100, then by Theorem 
[1.34] we have that n is prime if and only if it is not divisible by any of 2, 3,5, 7 since 
these are the primes at most 10 = /100. 

If n is a positive integer less than 49, then n is prime if and only if it is not 
divisible by any of 2, 3,5. 

If n is a positive integer less than 25, then n is prime if and only if it is not 
divisible by any of 2,3. 

In this way, it is quite easy to come up with the list of primes in (6). 


Figure 1.3.2. (276 BC-194 BC). Wikimedia Commons, Public Domain. 


Exercise 1.19. Verify that the primes less than 100 are given by the list in (L6). 


A convenient way to carry this out is to use the sieve of Eratosthenes, Namely, 
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cross out all the non-trivial (greater than 1) multiples of 2. Look for the smallest 
integer greater than 2 that isn’t crossed out, which is 3. Now cross out all the 
non-trivial multiples of 3. Look for the smallest integer greater than 3 that isn’t 
crossed out, which is 5, and cross out all the non-trivial multiples of 5. Look for 
the smallest integer greater than 5 that isn’t crossed out, which is 7, and cross out 
all the non-trivial multiples of 7. Since the smallest integer greater than 7 that isn’t 
crossed out is 11, which is greater than 100 = 10, we stop at 7. The remaining 
integers are primes (don’t count 1). (Note that in this way we have actually verified 
that 2,3,5,7 are the primes less than or equal to 10.) 


0 02 03 04 05 06 07 08 09 10 


1 12 13 14 15 16 17 18 19 20 


2 22 23 24 25 26 27 28 29 30 


3 32 33 34 35 36 37 38 39 40 


4 42 43 44 45 46 47 48 49 50 


5 52 53 54 55 56 57 58 59 60 


6 62 63 64 65 66 67 68 69 70 


7 72 73 74 75 76 77 78 79 80 


8 82 83 84 85 86 87 88 89 90 


9 92 93 94 95 96 97 98 99 | 100 


Figure 1.3.3. Cross out all of the multiples of 2, 3, 5, 7 between 1 and 100. 
Compare the integers remaining with a list of primes between 1 and 100. Are 
these two sets of integers the same? 


Exercise 1.20. Find the primes between 101 and 200 using the 
(you can read about this at the “Sieve of Eratosthenes” Wikipedia link, but 


we give a brief description below). See also Figure for the primes between 1 
and 100. In short, the sieve works by crossing out all multiples of at least 2 or more 
of the primes starting with the lowest prime first. For example, for the prime 2 we 
cross out 


100, 102, 104, 106, 108, ... , 200. 


For the prime 3, we cross out 
102, 105, 108, 111, 114,..., 198. 


(Note that there are redundancies.) Did considering primes greater than 13 help? 
Hints: See Figure and use Theorem |L.34 
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Ol | 102 03 | 104 05 | 106 O07 | 108 09 ) 110 


11 | 112 13 | 114 15 | 116 17 | 118 19 | 120 


21 | 122 23 | 124 25 | 126 27 | 128 29 | 130 


31 | 132 33 | 134 35 | 136 37 | 138 39 | 140 


41 | 142 43 | 144 45 | 146 47 | 148 49 | 150 


51 | 152 53 | 154 55 | 156 57 | 158 59 | 160 


61 | 162 63 | 164 65 | 166 67 | 168 69 | 170 


71 | 172 73 | 174 75 | 176 77 | 178 79 | 180 


81 | 182 83 | 184 85 | 186 87 | 188 89 | 190 


91 | 192 93 | 194 95 | 196 97 | 198 99 | 200 


Figure 1.3.4. Cross out all of the multiples of 2, 3, 5, 7, 11, and 13 between 
101 and 200. Compare the integers remaining with a list of primes between 
101 and 200. Are these two sets of integers the same? 


Exercise 1.21. Disprove the following conjecture (t.e., prove that the statement is 
false): For every non-negative integer n, Fy := 27° +1:= 22") 41 is a prime. You 
may wish to use some sort of computer aid (instead of working it out with paper 
and pencil). 

Remarkably, F, is known to be composite for all 5 <n < 32. (Do not try 
to prove this full statement!) Why can we characterize this as gone 
wrong? 


1.4. Division 


Primes are intimately related to multiplication and division. In this section we 
further consider division and some more of its elementary properties, and we also 
learn about some basic properties of the greatest common divisor of two integers. 


1.4.1. Factoring 1. 
“One and one make one.” — From ‘Bargain by The Who 


The rock group, The Who, were evidently thinking about multiplication. And 
what can be simpler than the multiplicative identity? The following, which we could 
have proved earlier(!), says that there is no other way to make one via multiplication 
of positive integers. So a mathematician would amend The Who’s lyrics by adding 
“uniquely”! 

Theorem 1.37. Suppose that a and b are positive integers satisfying ab = 1. Then 
a=b=1. 
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Proof. Let a and b be positive integers satisfying ab = 1. Suppose for a contradic- 
tion that a > 1. Then, since b > 1 and by (1.16), we have 


(1.23) l=a-b>a-l=a>l, 


which is a contradiction. Thus a < 1, and hence a = 1 since a > 0. This implies 
that 


1 
(1.24) p21, 


a 


Have you thought of any alternate proofs of Theorem [L.37f Figure [1.4.1] gives 
us an idea for one. 


1 i ee ee aaa “aro 2 ae ea ee oO, We ok 

a ee ee ee ee ee ee 

ees 
2 3 4 


Figure 1.4.1. Visual idea for a proof of Theorem The area of each 
colored region is equal to 1. 


Alternate proof of Theorem [1.37] Suppose for a contradiction that a > 1. 
Since + is a positive real number, we then have 


1 1 1 
(1.25) l=--a>=-1=-=b>50. 


a a a 
This is a contradiction since there is no integer strictly between 0 and 1. 


Given a result, it is usually interesting to consider special cases. In this case, 
for variety and to add a twist, we consider negative numbers instead of positive 
numbers in our special case: 


Corollary 1.38. There exists a unique negative integer n such that n? = 1. 
Namely, n = -1. 


Proof. Suppose that n is an integer satisfying n? = 1. Then |n| is a non-negative 
integer satisfying 


(1.26) bal et 


Suppose that |n| = 0. Then 0 = 0? = |n|? = 1, which is a contradiction. Thus |n| is 
a positive integer. So, by the preceding theorem, since |n| |n| = 1, we conclude that 
|n| = |n| = 1. Finally, this implies that n = —1 or n = 1. But since n is negative, 
we must have that n = —1. Since (—1)? = 1, we have proved that n = —1 is the 
unique negative integer with n? = 1. 


Here is an Alternate proof of Corollary [1.38] which implicitly assumes a 
couple of more facts but is shorter: By hypothesis, we have 


(1.27) 1=V1= Vn? = |nl. 


This implies that n = 1 or n = —1. Since n is negative, n = —1. So we are done 
since (—1)? = 1. 


Exercise 1.22. Prove that there exists a unique positive integer n such that n? = 1. 
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1.4.2. Division and its elementary properties. Recall that we say that an 
integer a an integer b if there exists an integer k such that 


(1.28) ak =b. 
We also say that b is a multiple of a. Recall that we say that a is aldivisor| of b. 
For example, 15 is a multiple of 3 since 3-5 = 15. And 3 is a divisor of 15. 


By definition, an integer n is even if and only if n is a multiple of 2. An integer 
m is odd if and only if m is not a multiple of 2. 


Figure [1.4.3] visualizes that 7 does not divide 139. 
Theorem [1.37] says that the only positive divisor of 1 is 1 itself. 


Figure 1.4.2. From these rectangles, we see that 18 is divisible by 3 and 6. 
The only other non-trivial way to factor 18 is as 2 times 9. So the positive 
divisors of 18 are Div(18) = {1, 2, 3,6, 9,18}. 


Observe that 7 divides 14 and that 5 divides 15, so the product 7-5 divides the 
product 14-15. More generally, we have the following. 


Solved Problem 1.39 (Division and products). All quantities are integers. If a 
divides b and if c divides d, then ac divides bd. 


Solution. Since a divides b and c divides d, there exist integers & and @ such 
that 
(1.29) b=ka, d= ke. 
Thus, multiplying these two integers, we obtain 
(1.30) bd = kalc = klac 


(where we used the commutativity of multiplication for the second equality). Since 
ké is an integer, we conclude that ac divides bd by the definition of divides. 


Exercise 1.23. Show that if an integer a divides an integer b and if c is an integer, 
then a divides bc. Hint: You may pattern your proof after the proof of Theorem 
[1.28] 

Another way to say this is: If a divides b and if d is a multiple of b, then a 
divides d. 


Show that, as a special case of this statement, we have: If a divides b, then a 
divides b?. 
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Exercise 1.24. Observe that 3 divides 6, and related to this, 9 (the square of 3) 
divides 36 (the square of 6). 


Generalizing this, prove: If 3 divides b, then 9 divides b?. 
Generalizing again, prove: If a divides b, then a? divides b?. 


Although the following has been observed earlier in this chapter, it is good for 
you to work it out again as an exercise. 


Exercise 1.25. Prove: If a divides b and if b divides c, then a divides c. 


1.4.3. The statement of the Division Theorem. Consider for example the 
problem of dividing 139 by 7 to get a remainder. That is, we are looking for the 
largest integer q, called the quotient, for which 7q < 139. We then define the 
remainder to be 


(1.31) r=139—7q; thatis, 139=7q+r. 
By a short calculation, we see that 
q=19 and r=6. 


Indeed, we have 139 = 7-19+ 6, whereas g = 20 would yield 7q = 140 > 139 
and hence produce a negative remainder. This can be visualized as in Figure [1.4.3 
Observe that the remainder satisfies r = 6 < 7. On the other hand, if we took q to 
be smaller, e.g., ¢g = 18, then we would obtain a remainder r > 7, e.g., in this case 
r = 13. To summarize, gq = 19 and r = 6 are the unique integers satisfying 


1389=7q+r and 0<r<7. 


T7T@eeeoeeeeneoeeeee se &@ @ @ @ 
6-eene eeeeeneeeeeeeee @ @ 
5@eeeeeeeneeseeenseeee e 
1 4@eeeoeceeaoeenseeeoeeoeevneeeeee @ 
cots 3 @ene@eeeeeneeoeeeee eee @ @ ere 
2@eeeeeeenseeeseeeeeeee e 
leeeeeeeeneeeoeeeoeeeeeee e 
12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19=q 


Figure 1.4.3. A visual rendering of the Division Theorem example that 139 = 
7q +r has quotient g = 19 and remainder r = 6. 


The general statement of the Division Theorem, which we prove in Chapter 
is the following. 


Theorem 1.40. Let a be an integer and let m be a positive integer. Then there 
are unique integers q and r such that 


(1.32) a=mq+r and 0<r<m. 


That is, if we divide an integer a by a positive integer m, then we get a unique 
quotient q and remainder r. 
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By considering the special case m = 2 of the Division Theorem and since 
0 <r< 2 means the same thing as r = 0 or r = 1, we have: 


Corollary 1.41. Leta be an integer. Then there are unique integers q and r such 
that 


(1.33) a=2q+r and r=0 or r=1. 
Furthermore, r = 0 if and only if a is even, and r = 1 if and only if a is odd. 
Example 1.42. We can use the Division Theorem to show that the integer 139 
is a prime. To see this, we just need to check that none of the primes 2,3, 5, 7,11 
divide 139, since these are the primes less than 12, and 12? = 144 > 139. We can 
see this as follows. By the Division Theorem, we may divide 139 by a prime p to 
get 
(1.34) 139 =p-qt+r, 
where the remainder r satisfies 

O<r<p. 
Then the table in Figure [I.4.4]shows that none of the remainders r is equal to zero. 
Hence 139 is prime. 


Pp q 
“9 68 4 
3/46 1 
5 |27 4 
7119 6 
tt | 19 7 


Figure 1.4.4. Table of values of primes p < 139, factors g, and remainders 
r, where 139 = p-q-+r. Since the last column has no zeroes, 139 is a prime. 


For example, the first row of Figure [I.4.4]is because 139 = 2-69+1 and0<1 <2. 


Remark 1.43. Without using the Division Theorem, one can show that 139 is 
prime by verifying that a is not an integer for each of p = 2,3,5,7,11. 


1.5. Greatest common divisor 


Given two integers, what do they have in common from the point of view of mul- 
tiplication? The answer is given by the greatest common divisor. We begin by 
discussing examples and then we give the formal definition of the greatest common 
divisor. 


1.5.1. Definition of the greatest common divisor. What do the integers 51 
and 68 have in common from the point of view of multiplication? Firstly, we 
observe that 17 divides 51 and 51 is a multiple of 17. This is because 17-3 = 51. 
By inspection, we see that the set of positive divisors of 51 is 


(1.35) Div(51) := {1,3, 17,51}. 
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Secondly, we also observe that 17 divides 68. The set of positive divisors of 68 is 
(1.36) Div(68) := {1, 2,4, 17,34, 68}. 


If we compare the lists of divisors for both 51 and 68, we see that the common 
divisors of 51 and 68 are 1 and 17. Thus, the greatest common divisor of 51 and 68 
is the number 17. We write this as 


(1.37) gcd(51, 68) = 17. 
That is, the largest integer that divides both 51 and 68 is the number 17. 


0 1 2 3 4 
0 17 34 51 68 


Figure 1.5.1. The gcd of 51 and 68 is 17: after factoring 51 and 68 by 17 we 
get 3 and 4, which are coprime. 


From the examples of 51 and 68, you may have anticipated the following. 


Exercise 1.26. Prove that a divisor of a positive integer b that is not equal to b 
cannot be more than one-half of b. 


We will find the notion of the maximum of two (or sometimes more) numbers 
useful. We embark on a brief excursion regarding the maximum. The maximum 
of two real numbers a and 0 is defined by 


a ifa>b, 

(1.38) max{a, b} := { ie BP eke 
Similarly, we define the minimum of two real numbers a and b by 

: b ifa>b, 

(1.39) min{a, b} := { a ees 


Solved Problem 1.44 (Being bigger than the bigger of the two is the same as 
being bigger than both!). Let a and b be real numbers. 


(1) Prove that max{a,b} > a and max{a, b} > b. 
(2) Prove that for every x € R, 
(1.40) x >max{a,b} if and only if («>a anda > b). 


Solution. Let m := max{a, b}. 
(1) Case i: a> b. In this case, m =a > b, som >aandm> b. 
Case ii: a < b. In this case, m = b > a, so again m > a and m > b. 


Since for every real number a and b, we have a > b or a < b, we have proved (1). 
Aside: Two elementary facts which we have used are: x = y implies x > y and 
x > y implies x > y. 

(2) (Only if) Suppose x > m. Then, by part (1) we have « > m > a and 
x >m-> b. Thus, by the transitivity of the relation >, we conclude that « > a and 
x> b. 
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(If) Suppose « >a and «> b. 

Case i: a > b. In this case, m=a<2,sox>m. 

Case ii: a < b. In this case, m = b < x, so again x > m. 
This proves (2). 


Exercise 1.27. Let a and b be real numbers. 
(1) Prove that min{a,b} < a and min{a, b} < b. 
(2) Prove that for every x € R, 
(1.41) x<min{a,b} if and only if (~<a anda <b). 
Solved Problem 1.45. Leta and b be positive odd integers greater than 1. Prove 
that 
(1.42) ab > max{3a, 3b}. 
Solution. Since a is an odd integer and a > 1, we have a > 3. Indeed, 2 is 
even, and hence not odd. For the same reasons, b > 3. Thus by (L.16), 
(1.43) ab>3b and ab>a3. 


Since we have both ab > 3b and ab > 3a, we conclude by Solved Problem 2) 
that ab > max{3a, 3b}. 


Exercise 1.28. Let a and b be positive integers. Prove that 
(1.44) ab > max{a, b}. 
Hint: Any positive integer is at least 1. 


Example 1.46 (Positive divisors of an integer). If a positive integer a divides 1575, 
then a < 1575 by Lemma[Z.8] One can also calculate (or at least verify!) that 
(1.45) ei gee ere ae 
In particular, the prime divisors of 1575 are 3, 5, and 7. One can check that the 
positive divisors of 1575 are 
(1.46) 1, 3,37 by Fe Be Bt By 3, Be 8 Be, 
T 32 th, So Be Bei 7, 8 rE Be ee 
Calculating the numerical values of this list of positive divisors yields 
(1.47) 1, 3, 9, 5, 15, 45, 25, 75, 225, 
7, 21, 63, 35, 105, 315, 175, 525, 1575. 

Observe that the integers in (1-46) are exactly of the form 
(1.48) BoP eT, 
where 0 <a<2,0<6< 2,and0<c< 1. That is, the list of integers in (1.46) 
consists of 3° - 5°. 7°, where the triples (a,b,c) are given by 
(1.49) 

(0, 0, 0), Ge 0, 0), (2, 05 0), (0, II 0), (1, 1, 0), (2; i; 0), (0, 2, 0), (1, 25 0), (2, 2, 0), 
(1.50) 

(0505 1). 05.1) (251054), (Od 1) (Lgl, 1), 250), (OD). (12.1). (2.26 ily, 
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Exercise 1.29. Can you come up with a conjecture about the relationship between 
factoring a number as a product of primes and the list of positive divisors of that 
number? 
Given a positive integer a, let Div(a) denote the set of positive divisors of a. 
Given m € Z*, let 
(1.51) Noy := {1,2,3,...,m} 
be the set of the first m positive integers. 


By Lemma|L.8} any positive divisor of a is an integer between 1 and a, inclusive, 
so that 


(1.52) Div(a) C Ng. 

Given two non-zero integers a and b, the set of common divisors of a and b 
is 
(1.53) ComDiv(a, b) = Div(a) N Div(6). 
Here, the symbol /M is denotes “intersection”. The intersection of two sets A and 
B is defined to be the set of elements that are in both sets|‘] So ComDiv(a, b) is 
the set of integers c such that c is a divisor of both a and b. For example, by 
and (1.36), we see that 

ComDiv(51, 68) = {1,17} 

since 1 and 17 are the only integers in both Div(51) and Div(68). 

We leave it to you to check that 
(1.54) ComDiv(18, 24) = {1, 2,3, 6}, 
and so gcd(18, 24) = 6. 

Observe that for all positive integers a and b, 
(1.55) ComDiv(a, b) C Na Ny = Nmninfa,0}- 
For example, ComDiv(51, 68) C Ns:. 

If one of the integers, say a, is equal to zero, then we have the following: 
(1.56) Div(0) = Z, ComDiv(0, b) = Div(0). 


In particular, for every two non-negative integers a and b, where at most one of 


them is zero, there always exists a greatest common divisor, which we denote 
by 


(1.57) gcd(a, b). 
Exercise 1.30. Show that gcd(5-17,23-17) = 17. 


A potentially divisive conversation that ends up finding common ground: 


Alpha: What do we have in common? 
Beta: I don’t know. What are your divisors? 


And the conversation continues. Can you fill in how it might go? 


®We will discuss set theory in more detail, including the notation of intersection, in Chapter [5] 
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Solved Problem 1.47 (Odd integers have nothing in common with 2 multiplica- 
tively). Let n be an odd integer. Prove that gcd(2,n) = 1. 


Solution. We have that g := gcd(2,n) is a positive integer dividing 2. Thus 
g = 1 or g = 2. Suppose for a contradiction that g = 2. Then 2 divides n, which 
implies that n is even, which contradicts the assumption that n is odd. Therefore 
g #2, so that g = 1. 


Solved Problem 1.48 (The gcd and multiples). Prove that if a is a positive 
integer, then 


(1.58) gcd(a,6) < gced(a, 30). 


Solution. Let g := gcd(a,6). Then g divides a, and g divides 6. Since 6 
divides 30, this implies that g divides 30. Therefore g divides both a and 30, which 
implies that g < gcd(a,30) by the definition of the greatest common divisor of a 
and 30. 


The property above generalizes as follows. 


Exercise 1.31. Let Z* be the|domain of discourse that is, the variables a,b,d we 


discuss below are all assumed to be positive integers. Prove that if d divides a, then 


gcd(d, b) < ged(a, b). 


1.5.2. Rational numbers. So far, we have stayed in the universe of integers. 
It is time to expand this universe! 
1.5.2.1. The definition of a rational number. We say that a real number r is a 


if it can be expressed in the form 


a 
1.59 r=, 

(1.59) : 

where a and 0 are integers. For this to make sense, we assume that b 4 0. For 
example, -i a 4 is a rational number. 


Example 1.49. (1) Integers are rational numbers. Indeed, if a is an integer, then 

we may write it as a= f. 

(2) Suppose that a = mq+r, where 0 <r <m. Then = is a rational number 
which is not an integer. Indeed, we have q< ££ <q+1. 


1.5.2.2. Fractions in lowest terms. We can understand rational numbers by 
using division. Let g = gcd(a,b). Given any rational number ¢, we define its 


b? 
to be the fraction 


a/g 

1.60 —. 
(1.60) a 
Here, we are considering a/g and b/g as integers. For example, the lowest terms of 


ey is $ since 51/17 = 3 and 68/17 = 4. Observe that gcd(3, 4) = 1. 


The following result explains why the lowest terms of a fraction has the gcd of 
the numerator and denominator equal to 1. 
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Theorem 1.50. For all positive integers a and b, we have that 
b 

(1.61) god (4,2) =i, 

where g = gcd(a, b). 


Proof. Suppose that c is a common divisor of : and B By definition, this 
means that there exist integers k and @ such that 


b 
(1.62) fen, ges 
g g 
By multiplying these equations by g, we may rewrite this as 
(1.63) (gc)k =a, (gc) = b. 


This tells us that gc is a common divisor of a and b. By the definition of g as the 
greatest common divisor of a and b, we obtain that 


(1.64) ge<g. 


Since g > 0, this implies that c < 1. This proves that the greatest common 
divisor of ; and - is 1. 


Figure exhibits the fact that 
51 3 


68 4 
Exercise 1.32. If § is in lowest terms, can both a and b be even? Explain your 
answer. 


Figure 1.5.2. Visualizing Theorem|1.50} Imagining the rational number a in 
lowest terms by dividing both the numerator and denominator by gcd(15, 20) = 
5 to obtain 3, where gced(3, 4) = 1. 


Exercise 1.33. Find the lowest terms expressions for the following rational num- 
bers: 


68 936 1748 1139163 
51’ 324’ 4199’ 987654321 - 


(1.65) 
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1.5.3. Coprime integers. Just as it is interesting when two integers have some- 
thing in common, it is interesting when two integers have nothing to do with each 
other, multiplicatively that is. 


Definition 1.51. We say that integers a and b, not both zero, are if 
gcd (a,b) = 1. 


In particular, two distinct primes are necessarily coprime. (For example, we 
have gcd(5,17) = 1.) Indeed, suppose that p and q are distinct (not equal) primes. 
Let g = gcd(p,q) be the greatest common divisor of p and q. Since g is a positive 
divisor of the prime p, we have g = 1 or g = p. Suppose for a contradiction that 
g #1. Then g = p. Since g divides q, this implies that p divides q. Since q is a 
prime and since p > 1, we conclude that p = q, a contradiction. Therefore g = 1. 


Exercise 1.34. Let p and q be non-equal prime numbers. Prove that gcd(p?, q”) = 
i 


More generally, it turns out that two integers are coprime if (and only if) they 
do not have any prime divisors in common. For example, the two integers 


2”.34-11 and 57-7°-19 


are coprime, for the prime divisors of the first integer are 2,3,11 and the prime 
divisors of the second integer are 5,7,19, and these two sets of primes contain no 
common primes. In other words, positive integers m and n are coprime means that 
if all the primes making up m disappeared from the list of all primes, then n would 
not be affected. 


Flippantly, we may say that for molecules, water H2O and salt NaCl are co- 
prime, but carbon dioxide CO and methane CH, are not coprime. © 


Theorem [1.50] says that for every two integers a and b, where at most one of 
them is zero: 


The integers a/g and b/g are coprime, where g is the gcd of a and b. 


See Chapter [4] for more discussion about coprime integers. 


1.5.4. Division and integral linear combinations. So far, we have studied some 
elementary properties of division restricted to the realm of the binary operation of 
multiplication. It is time to add the binary operation of addition into the mix. 


Regarding division, observe that 7 divides 21 and 7 divides 35. From this we 
can show 7 divides 21m + 35n for every pair of integers m,n. Indeed, let m and n 
be integers. We calculate that 


21m + 35n = 7(3m + 5n). 


Since 3m + 5n is an integer, we conclude that 7 divides 21m + 35n. 


We will find the following generalization of this rather useful. Why this is useful 
should not be immediately apparent to you! 
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Theorem 1.52. Suppose that c is a common divisor of integers a and b. Then for 
all integers m and n, we have that c divides the integral linear combination 


ma+nb 
of a and b. 
Proof. By hypothesis, there exist integers k and @ such that ck = a and cé = b. 
Hence 
(1.66) ma +nb = m(ck) + n(cl) = c(mk + ne). 


Since mk + né is an integer, by the definition of division this proves the desired 
conclusion. 


217 


Figure 1.5.3. Visualizing Theorem At each point (m,n) the inte- 
gral linear combination 21m + 35n is displayed right below it. For example, 

133 = (—3)21 + (—2)35 is displayed right below the point (—3,—2). Since 
gcd(21, 35) = 7, each integral linear combination is a multiple of 7. Horizon- 
tally we count by 21’s and vertically we count by 35’s. 


1.5.5. Example of a linear Diophantine equation. It is easy to solve linear 
equations for real numbers. What about for integers? We consider this question in 
detail in Chapter [4] For now, we just consider an example. 
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Solved Problem 1.53 (Solving a linear Diophantine equation by hand). Find 
integers m and n such that 


(1.67) 9m+11n=1. 

That is, find an integral linear combination of 9 and 11 that is equal to 1. 
Solution. Here is a simple approach. Counting by 9’s yields 

(1.68) 9, 18, 27, 36, 45,..., 


while counting by 11’s yields 


(1.69) 11, 22, 33, 44,.... 
Noticing that 45 is one more than 44, we see that 
(1.70) 9(5) + 11(—4) = 1, 
so that m = 5, n = —4 gives a solution to 9m + 1lln = 1. 


Since we were not asked to find all solutions to the equation, we just found a 
particular solution. See the following two exercises for examples of the more general 
question. 


Exercise 1.35. Verify that for every integer k, 
(1.71) m=5+11k, n=-—4—9k 


is a solution to (L67). We will see in Chapter [4] below that this is the general 
solution (that is, there are no other solutions)! 


Exercise 1.36. By trial and error, find integers m and n such that 


(1.72) 13m + 19n = 1. 


Using the above, find an integer solution to 


(1.73) 13m + 19n =5. 


Call your solution (mo, no). Is (mo +19, m9 —13) also an integer solution? Can 
you find infinitely many integer solutions? 


1.6. Statement of prime factorization 


Have you answered the question in the caption for Figure at the beginning of 
this chapter? If not, this section will help! 


1.6.1. Primes are the building blocks (dare we say atoms’) 


for the set of positive integers from the point of view of multiplication. 
It turns out that (we prove this in later chapters): 


Theorem 1.54. Any integer n > 2 may be written (uniquely) as the product of 
primes. 
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This is called the |prime factorization) of n, and this result is also known as 
the Fundamental Theorem of Arithmetic. Here are the prime factorizations of the 


integers from 2 to 19: 
2=2), 3=3', 4=27, 5=5), 6=2-3, 7=7', 
8=2?, 9=37, 10 = 2-5, 1l=11', 12=2?.3, 13 = 131, 
14= 2-7, 15=3-5, 16=2%, 17=17', 18=2-3?, 19 = 19'. 
As a reminder (we’ve referred to this result earlier), we have: 


Corollary 1.55. For every integer n > 2, there exists a prime p divisor of n. 


Proof. Let n > 2 be an integer. By Theorem|L.54} there exist primes pj, p2,..., Dk 
(where the primes may not be distinct) such that 


(1.74) n= Pi: p2°**Pk- 


So we have that p; is a prime divisor of n. 


Exercise 1.37. Find the prime factorizations of all non-prime integers 2 <n < 
100. Hint: In 41.6] we have considered n < 20. So you may start with n = 20. 
Consider the green integers in Figure Hint: We give the answer below, but 
again no peeking! 


22s 
32 | 33.| 34 


42| 43 | 44 | 45 50 
51| 52/53/54] 55/56/57) 58 | 59| 60 
61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 
71|72|73|74|75|76|77|78| 79 | 80 


81 | 82) 83 | 84) 85 | 86 | 87 | 88 | 89 | 90 
91 | 92 | 93 | 94 | 95 | 96 | 97 | 98 | 99 | 100 


Figure 1.6.1. A partially drawn visualization of the prime factorization for 
the integers 1 to 100. (The filled squares are the integers 2 through 20, as well 
as the primes and squares less than 50.) The primes are framed in thick lines. 
Note that no prime greater than 50 is a divisor of a composite number less 
than or equal to 100. 


Exercise 1.38. Find the prime factorization of 3233. You can use an online applet. 
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With the help of Figure we give a table of prime factorizations of the 
integers from 2 to 100, inclusive: 
(1.75) 
a 5) 2? 5 23 e he 3? 2-5 
11 973 1 2% 3°5 oF ‘lig 2: 3? 19 275 
327 2211 23 233 5? 2-13 33 7 29. 2-3-5 
3l Dh Sell 2-17 5«7 O23" 37 2-19 3-13 255 
41 2eark Ay 27 11 375 223 47 2°3 Vi 2.5? 
Sein (2713 53 23° Hail 277 3:19 2-29 59 273.55 
61 2-31 377 De 613 2+3+11 6F 2% 3:23 2-5-7 
71 D2 3° 73 2037 3-5? 2719 fell Q=3el3. - 79 245 
a 2-41 83 2793-7 5-17 2-43 3-29 211 89 2-375 
7-13: 2793 3.31 B-a7 5919 2°3 97 2.77 SP 11 275? 


We prove the “existence” part of the Prime Factorization Theorem[L.54] (a.k.a. 
Fundamental Theorem of Arithmetic) in §2.4.3] below. The “uniqueness” part is 


proved in §4.8] 


1.6.2. Euclid’s theorem that there are infinitely many primes. Bountiful 
primes! We can use division and a clever idea (using the power of proof by contra- 
diction!) to obtain the following result stated at the beginning of this chapter. 


Theorem 1.56 (Euclid’s Theorem). There are infinitely many primes. 


Proof. Suppose for a contradiction that there are only a finite number of primes 
in totality. Let n be the total number of primes, and denote the distinct primes by 


(1.76) P1,P2,+++5Pn- 
Let 
(1.77) Pere eer ae 


Since m > 2, by the Prime Factorization Theorem [1.54] there is some prime p,, 
where 1 <i <n, which divides m. 


Since p; also divides p,p2--- Pn, by Theorem[L.52] we obtain that p; divides the 
difference 
(1.78) M — pip2-**Pn = 1. 


Since p; > 1, this is a contradiction to Theorem Hence there are infinitely 
many primes. 


1.6.3. How to make a quick buck. Primes have a number of important real- 
world applications. And they are fun to boot! Here is a game you can play. 
Choose two really large primes p and q, and multiply them together to get pq. Tell 
your friend the integer pq and also tell them that this integer is the product of 
two primes and ask them to find the two primes. If you can find two primes large 
enough so that they can never win the game, then you have a real world application: 
The {RSA| algorithm in cryptography is based on the difficulty of finding the prime 


38 1. Evens, Odds, and Primes: A Taste of Number Theory 


t ++} t 
St 
E bee 
= ae OL 
con A og : 
o Ss x 
° 38 3 
ar | 
~ + ae 
= i ie 


Figure 1.6.2. If there are only a finite number of primes, there exists an 7 
such that p1p2---pn +1 divided by p; has remainder both 0 and 1, which is 


a contradiction. 


factorization for a large number that is the product of two large primes (called a 


semiprime). The basics of the RSA algorithm are discussed in §6.11*] below. 
If you are good at factoring semiprimes, you can make money: RSA Factoring 
Challenge, For example, the first RSA factoring challenge number is 
15226050279225333605356183781326374297180681149613 
80688657908494580122963258952897654000350692006139. 


Sorry for writing a single number on two lines; it was too long to write on one line! 
This number was factored as 


37975227936943673922808872755445627854565536638199 
x 40094690950920881030683735292761468389214899724061 


on April Fool’s Day (of all days!) of 1991 by|Arjen K. Lenstra\ 
You will make |200,000 USD) if you can find the factorization of the following 
number (if it has not already been done by the time you read this): 


2519590847565789349402718324004839857 1429282 1262040320277771378360436620 
207075955562640185258807844069182906412495150821892985591491761845028084 
891200728449926873928072877767359714183472702618963750149718246911650776 
13379859095 7000973304597488084284017974291006424586918171951187461215151 
72654632282216869987549 1824224336372590851418654620435767984233871847744 
479207399342365848238242811981638150106748104516603773060562016196762561 
338441436038339044149526344321901146575444541784240209246165157233507787 
077498171257724679629263863563732899121548314381678998850404453640235273 
81951378636564391212010397122822120720357 


This number is indeed the product of two primes p and gq. Presumably the presenters 
of the prize are the only ones who know the answer and they are bound to secrecy. 
Your challenge is to find p and q. 


One of the difficulties is current computational power. This is related to 


Moore’s Law 
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1.7*. Perfect numbers 


Question: What is the perfect number of hours to work per day? 
Answer: Six! We will see why below. 


Of course, primes are not the only natural numbers that have special properties. 
Observe that some numbers, like 36, have “lots” of divisors, whereas other numbers, 
like 39, have “few” divisors. Besides simply counting the number of positive divisors, 
we can sum the positive divisors of a number. For example, since the divisors of 36 
are 1,2,3,4,6,9,12, 18,36, the sum of the positive divisors of 36 not counting 36 
itself is “too much” in the sense that 


14+24+34+4+6+9+12+18 =55 > 36. 


On the other hand, the divisors of 39 are 1,3,18,39, and their sum not counting 
39 is “too little” in the sense that 


14+3418=17 < 39. 
When something is neither too much nor too little, we say that is perfect, so: 


Definition 1.57. We say that a positive integer n is a perfect number] if n is 


equal to the sum of its positive divisors less than n. 


For example, 6 is a perfect number since its positive divisors are 1,2,3,6 and 
(1.79) 14+2+3=6. 


Perfect! We also have that 28 is a perfect number since its positive divisors are 
1,2,4,7, 14,28 and 


(1.80) 1424447414 =28. 


Perfect again! 


Here are the first few perfect numbers 


(1.81) 6, 28, 496, 8128, 33550336, 8589869056, 137438691328. 
For example, the positive divisors of 496 are 

(1.82) 1,2, 4,8, 16, 31, 62, 124, 248, 496, 

and 

(1.83) 496 =14+24+4+8+416+431-+4 62+ 124 4 248. 


Remarkably, it is unknown if there exist odd perfect numbers. 
Conjecture 1.58 (Nicomachus). All perfect numbers are even. 


Exercise 1.39. (1) Prove that if p is a prime, then p is not a perfect number. 
(2) Prove that if p and q are primes, then pq is a perfect number if and only if 
pq = 6, 2.e., one of p,q is 2 and one of p,q is 3. 


Exercise 1.40. We know that 6 and 28 are perfect numbers. Prove that there are 
no perfect numbers n satisfying 6 <n < 28. 
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Figure 1.7*.1. Nichomachus. Line engraving by P. Ghigi after L. Agricola 


after Raphael. Wellcome Collection, |https://wellcomecollection.org/works/ 
Licensed under Creative Commons, |https://creativecommons.org/ 
publicdomain/mark/1.0| Public Domain. 


1.8*. One of the Mersenne conjectures 


It is cool to look at primes of various special forms. Remarkably, this topic is 


related to perfect numbers. A|Mersenne prime is a prime p that is equal to one 


less than a power of two. That is, a prime of the form 
p= 2” ~ 1, 


where n is some positive integer. Recall, by Theorem |L.16) n must be a prime for 
p to be a prime. For example, since 100 is not a prime, we know that 2! — 1 is 
not a prime. 


The Mersenne primes corresponding to the integers 


(1.84) 2,3,5,7, 13,17, 19, 31 


Figure 1.8*.1. {Marin Mersenne| (1588-1648). Wikimedia Commons, Public Domain. 
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are 
(1.85) 3, 7, 31, 127, 8191, 131071, 524287, 2147483647, 
respectively. 


Exercise 1.41. For n equal to the primes 11, 23, and 29, can you find the (non- 
trivial) prime factorizations of p= 2" —1? You may wish to use an online applet 
for this! 


Let q be a prime with the property that M, := 24 — 1 is a Mersenne prime. 
After the first 8 Mersenne primes in (1.85), the next 8 Mersenne primes are 


(1.86) Mei, Ms9, M107, M127, M521, Meo7, M1279, M2203. 


These integers get large quite quickly. For example, the sixteenth Mersenne prime 
M2203 has 664 digits! This particular Mersenne prime was found by |Raphael 
[Robinson in the year 1952. 


According to Wikipedia, as of October 2020, only 51 Mersenne primes are 
known. The largest Mersenne prime known, which has 24862048 digits, is 


Mgaseo933 = 2°7°59999 — 1 


(by the time you are reading this hopefully it is larger!). See the|Great_Internet_Mer- 


It has been by Lenstra, Pomerance, and Wagstaff that: 


Conjecture 1.59. There are an infinite number of Mersenne primes. 


This conjecture is at present unsolved, and fame and glory await the solver! 


We have the following beautiful result of Euclid and Euler (see Theorem [6.90] 
below for a proof): 


Theorem 1.60. 
An even positive integer n is a perfect number 
if and only if 


n = (2? — 1)2?-1, where 2? — 1 is a Mersenne prime! 


For example, the first few perfect numbers can be written as the products 
(where the first factor is a Mersenne prime): 


6=3-2, 
28 =7-4, 
496 = 31 - 16, 

8128 = 127 - 64, 


33550336 = 8191 - 4096. 


42 1. Evens, Odds, and Primes: A Taste of Number Theory 


1.9*. Twin primes: An excursion into the unknown 


Question: What did one twin prime say to the other? 
Answer: We are close, but we are not identical. 


There are certainly lots of things that we do not know about primes. This 
includes the following topic, although much amazing progress has been made. A 


is a pair of primes, one of which is equal to 2 plus the other. For 


example, the twin prime pairs of primes under 100 are 

(1.87) (3,5), (5,7), (11,13), (17, 19), (29,31), (41, 43), (59,61), (71, 73). 
The largest twin prime pair under one thousand is 

(1.88) (881, 883). 

The largest twin prime pair under one million is 


(1.89) (999959, 999961). 


As of this writing, the largest known twin prime pair is 
(1.90) (2996863034895 - 217900 _ 1, 2996863034895 - 2129000 + 1). 


This twin prime pair was found on September 14, 2016, by using the 
‘Twin Prime Search 


The following is the/I'win Prime Conjecture) As of this writing, this conjecture 


is still unsolved! 


Conjecture 1.61. There are an infinite number of twin prime pairs. 


A is a pair of primes, one of which is equal to 4 plus the 


other. For example, the cousin prime pairs of primes under 100 are 
(1.91) (3,7), (7,11), (13,17), (19, 23), (37,41), (43, 47), (67, 71), (79,83). 


Currently, it is unknown whether there are an infinite number of cousin prime 
pairs. 

Given a positive even integer k, let us say that a pair of primes is a k-prime 
pair if one of the primes is equal to k plus the other. Then twin and cousin prime 
pairs are the same as 2- and 4-prime pairs, respectively. 


Yitang Zhang proved the following[ 


“Zhang’s work depends on a number of mathematicians’ works including Bombieri, Deligne, Fried- 
lander, Goldston, Heath-Brown, Iwaniec, Pintz, Vinogradov, and Yildirim. 
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Theorem 1.62. There exists a positive even integer k less than 70 million (yikes, 
that’s a big number!) such that there are an infinite number of k-prime pairs. 


By the Polymath8 Project, this result has been improved to: 


Theorem 1.63. There exists a positive even integer k < 246 such that there are 
an infinite number of k-prime pairs. 


But, as of this writing, we do not know any specific such k, even though we 
know that such a & exists! 


1.10*. Goldbach’s conjecture 


Primes greater than 2 are always odd. So the sum of two such primes is always 
even. This naturally leads to the following question: 


Can we write each positive even integer as the sum of two prime numbers? 


Of course, 2 cannot be written as such since although 2 = 04+ 2 and 2=1+1, 
neither 0 nor 1 are prime numbers. So we should start with the number 4. As the 
first few cases, we have 


(1.92) 4=2+2,6=34+3, 8=3+4+5, 10=3+7, 12=5+7, 14=7+7. 
Here are a few more: 

(1.93) 16=3418, 18=5+4+138, 20=3+4+17, 22=3+4+19, 24=5+4+19. 
Skipping a few to 100, we have 


(1.94) 100 = 11+ 89, 102 =5+97, 104=7+497, 106 =174 89, 108 = 19489. 


A longstanding and tantalizing conjecture is the following. 


Conjecture 1.64 (Goldbach’s conjecture). Every even integer greater than 2 is 


the sum of two primes. 


Here are some more examples of even integers which are the sums of two prime 
numbers: 


(1.95) 1000 = 3+ 997 = 17+ 983 = 234+ 977 = --- = 491 + 509, 


where there are in total 28 ways to write 1000 as the sum of two primes. For each 
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of the next powers of 10, we just give one sum of primes, likely of very many: 


104 = 59 + 9941, 

10° = 11 + 99989, 
10° = 17 + 999983, 
107 = 29 + 9999971, 
10° = 11 + 99999989. 


He wane in AE Loritd go nymbros unise wade tn dae ties 
z x. owen LON, 
e eeeen, ae eae 
+ (aA AE De ne 
{ ae Jen fengse. ther Ooneett ey woe poe erage 
! vote ar 249 
fh e ge “Ets, Hie a 2 
_ ben. priteweges 
on 
- Ss ws iy sit fomats gets X. Lied rade wt facta =c. pametre (uy 
ih OG, daterminar: piifat Xx per C. et religpias conffarctes on Funch’, 
e ont, wana: poterct etiam detemiman 7 seminars weler be pile ie ae 
we geotiones ¥™ “2 Gav every | eh eG oe 


re ta 
ial fe a) 


Mews wong 
aypiosap 


Baie 47 7 laa Z vent 


s Se caneipindae tarvas cuted alfsfe x neha “i 
NR va freee cde 6 as pier iaoane: har of, 
b apphcera =e tee + ke. dite hy 

D papa ms apts prom gele Pris Zent 27 


7 =e: 


eo 


Aa Atte el nein tr eseret 
ter of 


a 4 nfow 
De 


oun, 
i * 
' 
os 
m4 
roe 
, 
Rye 


oF 


a 
Stag gs *ttig, fate 
Mescancdy y. Tar. ut 2 7742, J Ba 


Figure 1.10*.1. Letter from Goldbach to Euler stating his conjecture, dated 
1742. Wikimedia Commons, Public Domain. 
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Figure 1.10*.2. A plot of points when the horizontal coordinate is a positive 
even integer n and the vertical coordinate is the number of ways n can be 
written as the sum of two primes. This plot gives us confidence that Gold- 
bach’s conjecture is true! Credit: Wikimedia Commons, author: Reddish at 
the English-language Wikipedia, licensed under the GNU Free Documentation 
license, version 1.2 or any later version, and the Creative Commons Attribu- 
tion 3.0 Unported (https: //creativecommons.org/licenses/by-sa/3.0/deed.en) 
license. 


At the time of this writing, the following is known. 


Theorem 1.65. Every even integer greater than 2 and less that 4x 10'® is the sum 
of two primes. 


So, unless you have access to a powerful computer and lots of time on your 
hands, we don’t suggest that you look for counterexamples to Goldbach’s conjec- 
ture! 


1.11. Hints and partial solutions for the exercises 


Hint for Exercise [1.1] The color coding in Figure [1.0.3] starts as follows. 1 is 
gray. 2 is orange. 3 is green. 4 since it is 2? = 2-2 is two oranges, 5 is blue, 6 since 
it is 2-3 is an orange and a green. 7 is purple. 8 since it is 23 is three oranges. Etc. 
See §1.6]on prime factorization for more food for thought on this. 


Hint for Exercise Suppose a = n. Then n =a-b=7n-b. Now divide this 
equality by n. 
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Hint for Exercise Since we are proving an equivalence (a.k.a. a bicondi- 
tional), we prove two implications. Suppose that n = ab. 


(1) Suppose a > 1 and b > 1. Prove that 1<a<n. 
(2) Suppose that 1 <a<n. Prove that b> 1. 


Hint for Exercise Suppose that 0 divides an integer n. Show that n = 0. 
Then explain why this solves the exercise. 


Hint for Exercise Suppose that 1 <a <n. Prove that 1 < 6 < n using 
elementary properties about inequalities. 


Hint for Exercise The answer is given in each of Figure [I.1.2] and (£6). 


Hint for Exercise The second, fourth, sixth, eighth, and tenth columns 
comprise the even integers. 


Hint for Exercise We just need to check for divisors less than or equal to 
the square root of each number. 


Hint for Exercise [1.9} Our first impression is that the conjecture is false. This is 
from our knowing the falsehood of the conjectures about the primeness of integers 
of the form 2” — 1, n > 2, or even of the form 2? — 1, where p is a prime. 


Hint for Exercise Make the substitution x = 2%. Then 2” — 1 = 2° — 1 by 
2¢—1=2-—1. Apply polynomial long division to dividing 2? — 1 by x —1 (or you 
may already know what this ratio is having seen it somewhere). 


Hint for Exercise The key is to mimic Figures and 


Hint for Exercise Theorem says that “The product of any integer and 
an even integer is even.” Recast this statement as an “if-then” statement, and use 
the definition of even. 


Hint for Exercise [1.13} Draw a 2k +1 by 2€+ 1 rectangle and consider the 4 
“subrectangles” of dimensions 2k x 20, 2k x 1, 1 x 2@, and 1 x 1. 


Hint for Exercise Let a be an even integer. Apply the result that the 


product of an even integer and any integer is even to the product a® = a-a?. 


Hint for Exercise Suppose that the product of two integers a and 6 is even. 
Assume for a contradiction that not at least one of the integers a,b is even. This 
means that both a and 0b are odd. Use a result in the book. 


Hint for Exercise Use that V11 < 4. 


Hint for Exercise Assume for a contradiction that it is not true that 
a < 100 or b < 100. Then a > 100 and b > 100. Can you derive a contradiction 
from this assumption? 


Hint for Exercise we may assume that 
(1.96) O<a<b<e. 

Then show that 

(1.97) a = Hi: 
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Hint for Exercise We cross out all of the composite numbers from 2 to 
100, inclusive. Any such composite number is the product of a prime p less than 
or equal to 100 = 10 with an integer greater than 1. We call these products 
non-trivial multiples of p. These primes p are 2,3, 5,7. 

Observe that having crossed out all of the non-trivial multiples of primes, for 
primes less than or equal to p, the next uncrossed out integer is the next prime 
after p. For example, having crossed out all the non-trivial multiples of 2, 3, and 5, 
the next uncrossed out integer is 7. This in part explains why the process works. 

Having crossed out all of the non-trivial multiples of 2,3,5,7, the remaining 
integers are 1 (which is not a prime) and the exact list of primes less than or equal 
to 100. 


Hint for Exercise We only need to consider primes less than or equal 
to 200, which is less than 15. So we cross out the (non-trivial) multiples of 
2,3,5,7, 11,13. 

Hint for Exercise [1.21| Follow the advice of the exercise and check the fifth 
Use a computer and an online factorization applet. 

Hint for Exercise We simply apply Theorem[I.37]to the special case where 
a=b=n. 

Hint for Exercise Suppose a divides b. Then there exists an integer / such 
that b = ka. Square this equation. 


Hint for Exercise Suppose that 3 divides b. Then there exists an integer k 
such that b = 3k. Square this equation. For the second part of the exercise, replace 
3 by a. 


Hint for Exercise Suppose that a divides b and that b divides c. Then 
there exist integers k and @ such that 

(1.98) b= ka, c= bb. 

Continue. 

Hint for Exercise Let a be a positive divisor of a positive integer b with 


the property that a 4 b. Then a < b (why?). There exists a positive integer k such 
that ak = b (why?). We have k > 2 (why?). Continue. 


Hint for Exercise [1.27] This is very similar to Solved Problem [44] so mimic 
the proof thereof. 


Hint for Exercise We have a > 1 and b> 1. Use a basic fact about 
inequalities, multiplication, and the maximum. 

Hint for Exercise [1.29} For example, suppose that an integer n is equal to 
Dy ps? prs, where pj, p2,p3 are distinct primes and kj, k2,k3 are positive integers. 
Then the positive divisors of n are given by 


(1.99) pr pps, where 0 < ji < ky, O0< J2 < ka, 0< J3 < k3. 


Hint for Exercise Use that 5, 17, and 23 are prime to find all of the positive 
divisors of 5-17 and 23-17. 
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Hint for Exercise Let c be a common divisor of d and b. Use that d divides 
a and the transitivity of division. 


Hint for Exercise Is 2 a common divisor of both a and b? 


Hint for Exercise We have & = 4. 


Hint for Exercise Let p and q be non-equal prime numbers. Let g := 
gcd(p?,q?). Use that g is a positive integer dividing both p? and q? and that p and 
q are primes. You may use that the only positive divisors of p? are 1, p, and p?, 
and similarly for p replaced by q. 


Hint for Exercise Let & be an integer. Substitute m = 5+ 11k and 
n = —4— 9k into the expression 9m + 11n and verify that the answer is equal to 1. 
Hint for Exercise [1.36] Counting by 13’s yields 
(1.100) 13, 26, 39, 52, 65,..., 
while counting by 19’s yields 
(1.101) 19, 38, 57, 76, 95,.... 
By comparing the integers on these two lists, can you find a solution to the linear 
Diophantine equation (72)? 

For the second part of the exercise, multiply by 5. 

For the third part of the exercise, let k be any integer and calculate 
13(mo + 19k) + 19(mo — 13k). 
Hint for Exercise The answer is given by (1.75). 
Hint for Exercise One of the divisors is 53. 


Hint for Exercise [1.39] (1) The only divisors of p are 1 and p. 


(2) The only divisors of pq are 1, p,q, pg. Note that it is hypothetically possible 
that p = q. 


Hint for Exercise |1.40| By the previous exercise, we only need to show that 
the composite numbers strictly between 6 and 28 are all not perfect. To do this, 
one simply computes the sum of the positive divisors not equal to the composite 
number. 


Hint for Exercise We have that 2'! — 1 = 2047. An online applet factors 
this as 2047 = 23 - 89. 


Chapter 2 


Mathematical Induction 


Figure 2.0.1. As in the television series we are all interconnected. If 
we tip over the bear, then the koala, penguin, marmot, hippo, panda, squirrel, 
cat ,... must all follow! Figure drawn using the TikZlings package. 


Goals of this chapter: To explain how to prove results by mathematical 
induction. For example, we will prove various interesting sequences of formulas 
and sequences of inequalities. 

To show that there is a version of induction, called strong induction, that is 
more powerful in some important situations. To use strong induction to prove that 
any integer at least 2 is either a prime or a product of primes. To understand some 
properties of the amazing Fibonacci sequence. 


Mathematical induction is a very useful method of proof! 
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2.1. Mathematical induction 


Suppose that we want to prove an infinite number of statements P(n), one state- 
ment for every positive integer n. If we prove these statements one at a time, then 
things will not go very well as we will need an infinite number of proofs, which is 
not humanly possible to achieve. However, for many interesting sequences of state- 
ments, there is a method of proof, called mathematical induction, which works. 
This method is based on an axiom, called the axiom of induction. 


2.1.1. Example: An Consider the question: 
Let n be any positive integer. What is the sum of the first n 
positive integers? 


Firstly, we may write this sum as 


n 


(2.1) 1+24+--+(n-1l)+n= S74. 


i=1 


For example, by taking n = 3, we have 1+2+3=6. Apparently children learn 
the “dot dot dot” idea in grade school: Our daughter once taught us how to count 
to 100: 


one, two, skip a few, ninety-nine, one hundred! 
In mathematical notation, this is 
1,2,...,99, 100. © 


Let us look at what the sums in (2.1) are for the first several positive integers 


Table 2.1.1. Sums of the first n positive integers. 


n|123 4 5 6 7 8 9 10 
sum]1 3 6 10 15 21 28 36 45 55 


Factoring these sums as the products of two integers helps us understand if there 
is a pattern. We observe that Table 2.1.1] may be rewritten as 


Table 2.1.2. Sums of the first n positive integers rewritten. 


n|jl12 3 4 5 6 7 8 9 10 
sum|[1 3 2-3 2-5 3-5 3-7 4-7 4-9 5-9 5-11 


Notice that there is an interesting pattern in these factorizations. For n odd, the 


two factors of the sum are n and ae For n even, the two factors of the sum are 5 


and n+ 1. In both of these cases, this leads us to the following proposed formula: 


Theorem 2.1. [fn is a positive integer, then 
n(n +1) 


(2.2) Lee i=, 
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Since the right-hand side of (2.2) for 1 <n < 10 agrees with Table [2.1.2] we 
are confident that this formula is true. 


Presumably using this formula, as legend has it, young Carl Friedrich Gauss 


worked out on the spot in primary school: 


100 - 101 
(2.3) 1+2+--- don’t skip any! ---+99+ 100 = = = 5050. 


Now the question is: How do we prove formula (2.2)? In fact, there are a 
number of ways. The way we will prove it here is a more formal way, called 


mathematical induction, or just induction for short. 


We have observed that Table verifies formula (2.2) for 1<n< 10. Let us 
pretend, for example, that we don’t know that the formula is true for n = 9, but 
somehow we know that the formula is true for n = 8. Can we prove the formula for 
n = 9 from just knowing that the formula for n = 8 is true? Yes, here is a simple 
way, which is an example of the method of induction. We know that (the n = 8 
case) 

8(84+1 8-9 
(2.4) L+2434445464748= 64) _ 89 
Thus, by adding 9 to both sides of this equation, we obtain 
8-9 


Saks 


(2.5) 1424344+5+6+7+8+49 = (1+2+34+4+54+64+7+8)+9= —— 


We can put the right-hand side over the common denominator 2 to get 


8+2):9 9-(941 
(2.6) 14+243444546474849=! +) _ ot ) 
This is the desired formula for n = 9! (That is, for nm = 9; wow! Not n = 9 factorial; 
lol!) The upshot of all of this is that by just knowing that the formula is true for 


n = 8, we can prove that the formula is true for n = 9. 


Exercise 2.1. Let k be a positive integer. By just knowing formula (2.2) for n =k, 
can we prove (2.2) forn =k-+1? After all, we were able to do this for k = 8. 
Hint: The answer is given in 42.1.2] But see if you can work it out yourself! 


2.1.2. A proof by induction: Summing the positive integers. Firstly, we 
recall implications, which we briefly discussed in Chapter [1] and which is more 
formally treated in Chapter [3] So far in this book, we have been using “if-then” 
statements, which are statements of the form: 


If P, then Q. 
Another way to say this is 
P implies Q. 
And using symbols, we write this as 
P=Q. 


For our current purposes, regarding implications we will just use that proving an 
implication P = Q amounts to doing the following: 


Suppose that P is true. Prove that then Q is true. 
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Our first proof by induction is a|prototypical example} for proving sequences of 


equalities. 


Proof of Theorem [2.1] Let P(n) be the statement 1+2+---+n= 2), 


2 
Base case (n = 1): Evidently, since 1 = , we have that P(1) is true. 


Inductive step (P(k) = P(k+1)): Now, assume that k is a positive integer 
such that P(k) is true. (In the base case we have just seen that k = 1 is such an 
integer[}}) We will now show that P(& +1) is true. 

Proof: By hypothesis, k is assumed to be such that 
k(k +1) 

—_ 
Using this assumption, we calculate that (this is where the main work is) 


(2.8) 


(2.7) 14+24+--+k= 


1424---4+(k4+1)=(1424+---+k)4+(k4+1) 


k(k+1 2 
= SS + 3 bk +1) (this is where we used (2.7)) 
= ea (by the distributive law) 
(k+1)(K+1)4+1) 


oD) 


This says that P(k + 1) is true. So we have proved for all positive integers k that 
the implication P(k) => P(k + 1) is true. 

Now, the Axiom 2.5]of mathematical induction, which is formally stated below 
(see also Remark [2.3] below), is exactly the statement that in such a situation as 
this we can conclude that P(n) is true for all positive integers n. 


Remark 2.2 (Thinking forward > and backward +). A nice, but less formal, way 
to see that formula (2.2) is true without using induction is to sum positive integers 
both “forward” and “backward” as follows: 


1 + 2 + 3 +c n—-2 n—-1 n 
+n + n-1 4+ n-2 4+ «+ + 3 + 2 + 1 
n+1 4 n+1 n+1 me fo om t+1 4+ nt+itl+in4+i 


Since there are n of the n+ 1’s in the last line, we obtain that 
(2.9) 2(14+24+34---+(n—2)4+(n—-1)+n)=n(n+1), 
which is equivalent to (2.2). We guess that this must have been young Gauss’s trick 


in computing (2.3)! See Figure [2.3.1] below for a related visual proof of Theorem 


Remark 2.3. For the inductive step, we use the assumption that P(k) is true, 
(2.7), to prove that P(k + 1) is true, (2.8). That is, in the proof above, for the 
inductive step, the key was to show the implication: 


(2.10) If (2.7) is true, then (2.8) is true. 


1Note that in the base case, we didn’t assume that P(1) is true, we proved that P(1) is true. 
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Metaphorically, we are building a bridge between the two statements. See Figure 
Induction says that if the first statement P(1) is true and if the implication 
(2.10) is true for all positive integers k, then P(n) is true for all positive integers n. 


Figure 2.1.1. A (Golden Gate) Bridge between the inductive hypothesis and 
the inductive conclusion. This bridge enables us to go from San Francisco 
(P(k)) to Sausalito (P(k + 1)). Photo credit: Golden Gate Bridge, De- 
cember 15, 2015, by D. Ramey Logan.jpg from Wikimedia Commons by D. 
Ramey Logan (https://don.logan.com), under Creative Commons CC-BY 4.0 


(https: //creativecommons.org/licenses/by/4.0/deed.en) license. 


2.1.3. The general idea of induction. The concrete discussion above illus- 
trates the fundamental idea of induction. Consider an infinite sequence of state- 
ments 


(2.11) P(1), P(2), P(3),.... 


Suppose we know that P(1) is true. How can we prove that all of the statements 
are true? 


In the examples above, firstly we observed that by just knowing P(8) is true 
we can deduce that P(9) is true. Secondly, we generalized this to show that, for 
every positive integer k, if P(k) is true, then P(k + 1) is true. 

Now, if we know that P(1) implies P(2), then since P(1) is true, so is P(2). 
Next, if we know that P(2) implies P(3), since P(2) is true, we then know that 
P(8) is true. Here, we need to know that P(3) implies P(4). If so, then P(4) is 
true. And so on. We should be able to go on forever! This is the idea of induction. 


Given a particular sequence of statements P(n), n > 1, we can make a sweeping 
conjecture: No matter what positive integer k is, the implication 


P(k) implies P(k + 1) 


is true. This would mean that P(1) implies P(2), and P(2) implies P(3), ..., and 
P(8) implies P(9), and .... So all of the statements should be true since we assumed 
that P(1) is true! This is what the axiom of mathematical induction is all about. 
Fortunately, we can often prove such a sweeping conjecture. 


2.1.4. More proofs by induction: Arithmetic sums. Now, let us sum just the 
odd integers. Recall that Z* denotes the set of positive integers. 
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Exercise 2.2. Prove by induction that for alln € Z*, 


(2.12) S 0 (i-1) =143454+---+ (Qn-1) =n’. 


Hint: Model your proof on the proof of Theorem 


See Figure 2.1.2] for a visual proof of (2.12)! Although in this book we are 
teaching formal proofs, some visual proofs such as this may be considered as superior 
to induction, easier to remember, and delightful! 


Figure 2.1.2. A visual proof that 1+34+5+7+94114+134+15 = 87. This 
is the n = 8 case of (2.12). 


Let a be|pi\ and let e be So, approximately, 7 + 3.14159 and 
e © 2.71828. 


Solved Problem 2.4. Show by induction that for all positive integers n, 


n 


(2.13) dri +e)= mnt, + ne. 


Solution. Base case: n = 1. This is true because the formula says in this case 
that 
1(1 +1) 


(2.14) m-lt+e= 5 mT+1-e, 


which is true since both sides equal 7 + e. 


Inductive step: Suppose that k is a positive integer with the property that 


(2.15) din te) = mee Het he: 
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We then compute that 


k+l k 

dori +e)= dori +e) +(a(k+1) +e) 
= (a as + ke) + (a(k +1) +e) 
= MEO + (kt Inthe te 


Z EAE, L(k-+ Le. 


This is the desired inductive conclusion. By induction we are done. 


Another way to solve this problem is to use the fact that )7/,7 = aint) by 
(2.2). Indeed, using this we calculate that 


(2.16) iris aris eet tne. 


Note that we used the commutativity of addition to split the sum into two sums. 
We also used the distributive law to pull out the factor of 7 in front of the first sum 
on the right-hand side. 


More generally, we can sum any jarithmetic progression 


Exercise 2.3. Let a and b be real numbers. Prove that for alln € Zt, 


(2.17) Yo + ia) = (64a) +o + G+ na) =n (AEAa 40). 


i=l 


Observe that Exercise [2.2] is the special case where a = 2 and b = —1 and Solved 
Problem 2.4] is the special case where a= 7 and b= e. 


2.1.5. Formal statement of induction. Now, finally, we have the formal state- 
ment of mathematical induction as an axiom. 


Axiom 2.5 (Mathematical Induction). Let P(1), P(2), P(3),... be an infinite se- 
quence of statements. Suppose that P(1) is true, and suppose that P(k) implies 
P(k+1) for allk >1. Then P(n) is true for alln > 1. 


To wit, induction says that the following hypothesis implies the following con- 
clusion: 


Hypothesis: 

(1) P(1) is true. 

(2) P(k) implies P(k + 1) for all k > 1. 
Conclusion: P(n) is true for all n > 1. 


Recall that to prove the implication P(k) implies P(k + 1), we assume that P(k) 
is true. Then, by logical deduction, we need to prove that P(k + 1) is true. 
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2.1.6. The domino effect. Mathematical induction can be visualized as the 
(not to be confused with the {domino theory!). Imagine we have an 
infinite string of dominoes. We have carefully placed all of the dominoes so that for 
every k if the k-th domino in the line topples, then the (k + 1)-st domino topples 
(this is the inductive step). Mathematical induction says that if we knock over the 
first domino, then for every n, the n-th domino (eventually) topples. To visualize 
induction, see Figure2.1.3]for tipping dominoes instead of the tipping cute animals 
pictured at the beginning of this chapter. 


Induction can be used to prove the well-known formula for 


Firstly, consider summing powers of 2: 
1,14+2,1424+4,14+2+4+4+8,14+24+4+4+8+16,.... 
We easily calculate that these sums are equal to 
1, 3, 7, 15, 31,.... 


To see a pattern more clearly, we add 1 to each sum to obtain 


2, 4, 8, 16, 32,..., 


A sequence of statements. 
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Base case: First statement is true. 


Inductive step: k-th statement implies (& + 1)-st statement for every k. 


Conclusion: all statements are true. 
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Figure 2.1.3. Dominoes illustrating the idea of mathematical induction. 
Given a line of dominoes P), P2, P3,..., Pr, Prii,Pr+2,.--, if the first one 
P, falls and if the k-th one P, falls implies the (k + 1)-st one P,41 falls for 
all k, then they all (Py for all n > 1) fall. 
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which we can rewrite as powers of 2: 
Da ise es rE) eee 
Observe, e.g., that 


4 
14+24+448+416=294+2'42? 42% +24=S 2! 
1=0 


and that this sum is equal to 
31=2-1=2*t1- 1. 


This leads us to guess the formula: For any non-negative integer n, we have 
(2.18) 14242 4---42% =) 2 samt} 1, 
i=0 


Secondly, consider summing powers of 3: 
1,1+3, 14349, 1+3+9+4 27, 14+34+9+427+81,.... 
We calculate that these sums are equal to 
1, 4, 18, 40, 121,.... 
If we multiply these sums by 2 and then add 1, then we obtain 
3, 9, 27, 81, 248,.... 
We can rewrite these integers as powers of 3: 
Sr: Goer ns fers 
So we have, e.g., that ; 
14250 3' = 243 = 3477, 


i=0 
Thus, we guess that for every non-negative integer n, we have 


(2.19) SS <= 


In the above, we have guessed the formula for geometric sums with base equal 
to 2 and base equal to 3. We can actually prove, by induction, the formula for a 
geometric sum with any base r, so long as r 4 1. Of course, on the other hand, if 
r = 1, then the sum is easy: Oj.) 1’ =n +1. 


Exercise 2.4. Let r € R, where r #1. Prove that for alln € Z*, we have for a 
finite geometric sum the formula 


22 Podtrtr? t-te = 
(2.20) aor r+r r cae 


In other words, forn € Z* andr ER, 
(2.21) rth = (r-A(r® $r7 +--+ 9? 4+r41). 


We can go from finite series to infinite series by taking a limit. A formula which 
also has a geometric proof is the following. 
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Exercise 2.5. Can you prove that the infinite geometric sum of the positive powers 
of one-half is equal to one-third: 


(2.22) > ey = 3? 


n=1 


This is an infinite sum (t.e., an\infinite series), so you will have to take a limit of 


the (finite) partial sums 


(2.23) Sk i= % 3)" 


ask ow. 


Formula (2.22) may be visualized as the lovely Figure 2.1.4] 


Figure 2.1.4. A visual proof of (2.22). For every rainbow square, there are 
two white squares of the same size. If the big square is 1-by-1, then the sum 


of the areas of the rainbow squares is equal to 4 + Zz - red fee = z The 
sum of the areas of the white squares is equal to 2. 


2.1.7. Caesar is a salad dressing. Let us think more about what induction 
means. This will help us understand the key ideas and clear up any misconcep- 


tions. Seemingly unrelated, part of a conversation with in the movie 
“Bill and_Ted’s Excellent Adventure)’ went like this: 


Ted: What do I say? 

Bill (shrugging): Philosophize with him. 

Ted (shrugs; to Socrates): “All we are is dust in the wind”, dudel| 
However, let us imagine how this conversation might have continued: 

Bill: Dude, like what is induction? 


?“Dust in the Wind” is a song by Kansas. 
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Ted: It’s like the song “Another One Bites the Dust” by Queen. (In the 
dialogue below, Bill and Ted try to stay true to the lyrics of the Queen song.) 


Bill: Huh? What do you mean? 


SOCRATES: Proving the base case P(1) is like: “Are you ready, hey, are you 
ready for this?” 


Bill: Yes, I’m hanging on the edge of my seat! Or, it’s like “Start Me Up” 
(song by The Rolling Stones). Excellent! 

Ted: Then we have the inductive hypothesis P(k), which is like: “Another one 
bites the dust.” 


Bill: Dude, what do you do with this? 


Ted: You work hard to prove the inductive conclusion P(k +1), which is like: 
“Hey, I’m gonna get you too. Another one bites the dust.” 

Bill: So “I’m gonna get you too” is like proving the implication P(k) > P(k+1) 
in the inductive step. 

SOCRATES: Right, dude (Socrates learns quickly how to use the word “dude” ). 
Then, by induction we are done: “Another one bites the dust. Another one bites 
the dust. And another one gone and another one gone.” This is like the conclusion 
that P(n) is true for all positive integers n. 


Bill: That’s pretty repetitive, dude. 


Ted: It’s infinitely repetitive, dude. That is the only way to get everyone to 
bite the dust. “Are you happy, are you satisfied?” 


Bill: Yeah, I can stand the heat. You’ve convinced me! You are awesome. 
Anything you would like to add? 


Ted (as in the movie, concludes): “Yes, and that thanks to great leaders, 
such as Genghis Khan, Julius Caesar, and Socratic Method, the world is ... full of 
history!” 


With this Socratic understanding of induction, we return to the practical matter 
of proving statements by induction. Given that we have seen the induction proof 
of the formula for the sum of positive integers in can you mimic this proof 
(in a broad sense) to prove formulas for sums of small powers of positive integers? 


Exercise 2.6. Prove that for alln € Z*, 


— + 1)(2n +1) 
2.24 Dafa Pay npg 
( ) d' + n 6 


Exercise 2.7. Prove by induction that the sum of the squares of the first n positive 
odd integers is equal to 


n(2n—1)(Q2n+1)_ 4n?-n 


3 3 
Exercise 2.8. Prove that for alln € Z*, 


(2.25) eye = —12 49? 3? 4...4 (-1)"n? = (-1) 


i=l 
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Exercise 2.9. Prove that for alln € Z™, 


(2.26) ye — 13493433 4...4 ns 


i=l 


Furthermore, observe that this is equal to (1+2+3+---+n)?. 


The following formula follows from a previous result and exercise, but can you 
prove it directly by induction? 


Exercise 2.10. Prove that for alln € Zt, 


(2.27) See eee y= Mt ent) 


Let us consider sums of ratios, actually simple rational functions. 


Exercise 2.11. Prove that for alln € Zt, 


n 


1 1 1 1 1 n 
2.28 = A _ 
2) +0 iS 26° 54° “wel nei 


Exercise 2.12. Prove that for alln € Zt, 


n 


1 
(2.29) 2, (2i — 1)(21 +1) 
ee ee a: ee 1 on 
~ 153° 368° Bey *  Qn—1)(Qn+1) 7 In+1° 


Instead of sums, we can consider products. 


Exercise 2.13. Prove that if n is an even positive integer, then 


won (8) (48) (2) Eh 


Explain (not prove), without using induction, why the formula is true. 


Exercise 2.14. Prove that for all integers n > 2, 


(2.31) I (1-7) -(1- 3) (1-3) --(1-4)- 


2.1.8. Flaws in attempts at inductive proofs. We can still make mistakes in 
proving statements by induction if we are not careful. A classical flawed proof by 


induction (in this case a|falsidical paradox) is the proof that 
all horses are the same color 


The “fake proof” goes like this. (We will use some basic set theory notation. 
If you are not familiar with the notations, you may find them in Chapter 5] below.) 
We will grant ourselves the simplifying assumption that each horse has a single, 
specific color, so that we don’t for example allow for striped horses (this is not 
where the proof breaks down). 
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Ben Ben Fen eee Ben Fen a set H of k +1 horses 


hy ho hg hr Apya 
Ben Fen oe Bee Fen Hy, = H — {hy} is a set of k horses 
ho hg hy Pegi 
Bee Fe eee Bee Fen Hy = H — {ho} is also a set of k horses 
hy hg hy Api 
Fe eee x Fe the horses in H, 9 Ho all have the same color 
hg hy Pega 


Figure 2.1.5. Fake induction: the trouble is in getting started! By the induc- 
tive hypothesis, each of the sets H; and H2 (with k horses) consists of horses 
of a single color. So, since all of the horses in Hi  H2 are of the same color, 
we deduce that all of the horses in H are of the same color. But(!), for this 
argument to work, we need k+1 > 3, i.e., k > 2. In particular, this argument 
doesn’t work for k = 1 since then Hi M H2 = @ is the empty set! Credit for 
the drawing of the red horse: Designed by tartila / Freepik.com. 


Let P(n) be the statement that the horses in any set of n horses all have the 
same color. 


Base case. P(1) is trivial since we have a collection of only 1 horse. 


Inductive step. Suppose that & is a positive integer such that P(k) is true. 
Let H be a set of k +1 horses. Take one horse h, away from H to get a set 


Ay = H- {hi} 


of k horses. By the inductive hypothesis, all the horses in H; have the same color 
Ch. 

Next, choose a horse hz that is different than the horse hy. Take away the 
horse hg to get the set 

Ay = H- {ho}. 

Since Hz consists of k horses, by the inductive hypothesis again, all the horses in 
Hy have the same color Cy. Now take a third horse hg different than both h; and 
hy. We have that hg € H, and h3 € Ho. This implies that h3 has both the color 
C;, and the color C2. Therefore, C and C2 are the same color. We conclude that 
hy € Hz has the color C2, which is the same as the color C;. Therefore all of the 
horses in H have the same color C;. 


By induction, we are done! 
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Exercise 2.15. Can you detect the mistake in the flawed proof above? Hint: Think 
about how to prove P(2) from P(1). A more detailed hint is in the caption to Figure 


2.1.5) 


A different type of mistake in induction proofs stems from misunderstanding 
the meaning of “implication”. The following is an example of this. 
Exercise 2.16. Explain what is wrong with the following “proof” by induction. 

Non-Theorem. n? <n for alln € Zt. 

Non-Proof. Clearly the base case n = 1 is true because 17 = 1. 

Inductive step. Suppose (k +1)? <k+1. Then 

k?74+2k+1<k+1. 
Thus (subtracting 1 from both sides of the inequality) 
k? + 2k <k. 

This implies k? <k. We have proved that if k? <k, then (k +1)? <k+1. 

By induction we are done. 
Exercise 2.17 (Be excellent to each other (and party on, dudes!)). Explain what 


is not so excellent about the following statement and proof by induction (it’s not 
the base case). 


Statement: All adventures have the same degree of excellency. 
Proof by induction. Let P(n) be the statement: 


If A is any set of n adventures, then all the adventures in A have 
the same degree of excellency. 


Base case: Let A be a set of 1 adventure. Since A only has one element, all 
adventures in A have the same degree of excellency (as the sole element). 


Inductive step: Let k be a positive integer with the property that all the adven- 
tures in any set of k adventures have the same degree of excellency. 


Now let A be a set of k+1 adventures. Choose distinct adventures a, and ag in 
A. Then B := A—{a} andC := A—{az} each are sets of k adventures. So, by the 
inductive hypothesis, all of the adventures in B have the same degree of excellency, 
and all of the adventures in C have the same degree of excellency. 


Let a3 be any adventure in A distinct from both a, and ag. Then ag is in both 
Band C. Take any adventure in A. Then the adventure is in both B and C and 
hence has the same degree of excellency as a3. This proves that all of the adventures 
in A have the same degree of excellency. 


This completes the proof of the inductive step. 


By induction the result follows. 


2.2. Rates of growth of functions 


Just as we can prove formulas by induction, that is, equalities of different expres- 
sions, we can prove inequalities by induction. As you may have heard by now, 
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exponential functions grow faster than polynomials. We will look at examples of 
this from the point of view of induction. Through this, we will get some experience 
working with inequalities. 


2.2.1. Exponentials versus polynomials: Racing to infinity. 


Question: An exponential function and a polynomial race to infinity. 
Who gets there first? 
Answer: Neither. Both never get there! 


A bit of calculus: Consider the limit 


(2.32) lim < =0o. 


Zoo L 


Since lim; .oo e” = 00 and limy_,.. x? = 00, the limit in (2.32) has the indetermi- 


nate form &. We can apply |’Hospital’s rule (twice) to conclude that 


x x x 


ae a se a 


So, certainly, we have for x sufficiently large that 


e* > x. 


> 


Figure 2.2.1. A race/to infinity and beyond] between the functions « +> x? 
and z+ e”. 


By induction, we can determine for which integer values of x we have this 
inequality. For this, we will introduce a slight generalization of the method of 
mathematical induction. For Mathematical Induction Axiom the base case is 
n = 1, and we prove statements for all integers n > 1. More generally, we can 
assume that the base case is any given integer, for example, n = 2. In this case, for 
the inductive step we need to prove that P(k) = P(k +1) for all integers k > 2. 
Then, the slight generalization of induction allows us to conclude that P(n) is true 
for all integers n > 2. Formally, this version of induction is stated as Axiom 
below. 
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Theorem 2.6. For any n € Z*, we have the inequality 
(2.34) e“sun', 


Proof. There are various, roughly equivalent, approaches to proving this by induc- 
tion. We choose to write the desired inequality in “ratio form”: 

e” 
(2.35) oo 
That is, (2.34) and (2.35) are equivalent. We remark that two inequalities are equiv- 
alent if each inequality implies the other. That is, if either of the two inequalities 
is true, then the other inequality is true. 


Clearly 


1 


e 


so that the case n = 1 is true. (For reasons we will explain below, we will not call 
this the base case.) Now suppose that k > 1 is such that (the inductive hypothesis) 


ek 


(2.37) 7 > Is 
We observe that one way to ensure that the desired inductive conclusion is true is 
to prove the “speculative” inequality (we are hoping that this inequality is true) 


ektl ah 

(2.38) ares 

Indeed, if inequality is true, then the inductive hypothesis implies that 
ektl 

(2.39) (+12 > 1, 


which is the desired inductive conclusion. 
Now, we observe that the speculative inequality (2.38) is equivalent to 


eftl  (k+1)? a 
= = = ‘i 
(2.40) pa a (1+ z) 


By taking square roots, we see that inequality (2.40), in turn, is equivalent to 
the inequality 


(2.41) e> 1+ = 


By the aforestated equivalences and implication of inequalities, if (2.41) is true, 
then the inductive conclusion is true. Thus, it suffices to prove inequality (2.41). 
If k > 2, then we have 

1 iL —— = 
(2.42) l+7<l+5=15= /2.25 < Je. 
Hence the desired inequality (2.41) is true provided k > 2. To summarize, we have 
proved for k > 2 the implication 

ek k+1 
(2.43) =>1l > VW? 


ian) 
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Now since we could only prove (2.43) assuming that k > 2, we have to take the 
base case to be k = 2. In this case we have 


22 
(2.44) xz Sch, 
so that the base case is true. 


By the slight generalization of induction (Axiom [2.8] below), we conclude that 
for all n > 2, 


(2.45) oo le 


But we have already observed that this inequality is also true for n = 1. Thus, we 
have completed the proof of the theorem. 


Remark 2.7 (Manipulating inequalities in proofs). The proof of Theorem 22.6]has 
the following general structure. We started with what we’ll call “Inequality A”, 
which we want to prove. We showed that this follows from “Inequality B” and 
that “Inequality B” is equivalent to “Inequality C”. Then we prove “Inequality C”, 
and we conclude that “Inequality A” is true. Regarding this, the equivalence of 
inequalities allows us to reason backwards, which is essential to our proof. 


An example of where we cannot reason backwards is the following. Let x and 
y be real numbers. Suppose x > y. Then 0-2 > 0-y, which is equivalent to 0 > 0. 
Although 0 > 0 is true, in this case we cannot reason backwards and conclude that 
x > y for all real numbers « and y. 


Can you mimic the idea of the proof of Theorem [2.6] to solve the following? 


Exercise 2.18. Prove by induction that for each integer n > 4, 4" > 16n?. 


2.2.2. Induction starting at a base case other than n = 1. The proof of 
Theorem [2.6] is an example of a proof by induction where the base case is an 
integer different from 1. Here is the slightly more general version of induction we 
used above. 


Axiom 2.8 (Mathematical induction slightly more generally). Let b be an integer, 
and let P(b), P(b+ 1), P(b+ 2),... be an infinite sequence of statements. Suppose 
that P(b) is true, and suppose that P(k) implies P(k+1) for allk > b. Then P(n) 
is true for alln > b. 


This axiom, where the base case is an arbitrary fixed integer b, is equivalent to 
the usual induction axiom where the base case is the integer 1. We won’t prove this 
equivalence. Readers looking for something fun to do may prove it for themselves. 


We can also combine induction with a little bit of logical deduction to prove a 
fun result. 


Exercise 2.19 (Kung Fu Panda meets /The Penguin). Prove the following state- 


ment is true for all integers n > 2. If a total of n pandas and penguins stand in a 
line to get gelato, where the first animal in the line is a panda and the last animal 
in the line is a penguin, then somewhere in the line there is a panda directly in 
front of a penguin. 
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Remark 2.9. As usual, a hint for the exercise above is given at the end of the 
chapter. However, you may also wish to solve this exercise by reasoning (less 
formally) not using induction. Namely, in the gelato line there is always a last 
panda, and this panda is by assumption not the last animal in the line. What 
animal is behind the last panda? 


2.2.3. Comparing exponentials and polynomials a little more generally. We 
can create many exercises similar to Exercise Can you try your hand at one? 


Exercise 2.20 (Create your own problem, but don’t make it one of Jay-Z’s 99). 
Choose an\interesting positive integer|I and a boring positive integer B, where B > 
I. Using these diametrically opposite integers, find the least positive integer L such 
that: For each integer n > L, 


(2.46) LOS Die. 


Prove this statement by induction. 


We can also change the order of the monomial from 2 to any positive real 
number p. To be more concrete, we choose the base of the exponential function to 
be 2. 


Solved Problem 2.10 (Exponentials versus monomials: A general look). Let p 
be a positive real number, and let N > (2'/P —1)~! be a positive integer. Prove that 


Qn 
(2.47) 2 > ne for alln>N. 


Solution. The proof is actually not difficult because although the inequalities 
may look complicated, they have been rigged to work. 

The base case n = N is true because in this case we have 

nw _ 2” 
2 agp 

(2.48) 2 = ne : 

Inductive step: Suppose that k > N is such that 2” > 2” KP. This inequality 
is equivalent to (we put the k’s on one side and the N’s on the other side) 


kP | NP 
(2.49) 5R < 3N° 
Thus, to prove the inductive step, it suffices to prove the inequality 
(k+1)P _ kP 
(2.50) “kL = 9R° 
Indeed, since (2.49) is true, if (2.50) is true, then we have 
(k+1)? — NP 
(2.51) “okel > BN” 


which is equivalent to the desired inductive conclusion 2“+! > (k+1)?. 
Now, the inequality (2.50) we want to prove is equivalent to the inequality 


1\” k+1\? (k+1)P _ 2*t1 
(2.52) (1+;) ( ; ) ae Sage 
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This, in turn, by taking p-th roots, is equivalent to 


1 
=e OP Ty 
k —_— ? 
which further in turn is equivalent to 
1 


Since k > N and by our assumption N > (2!/? — 1)~!, we have that this last 
inequality is true. Thus (2.50) is true, and we conclude that the inductive conclusion 
is true. Finally, by induction (the slightly more general Axiom 2.8), we are done. 


Remark 2.11. Let us see how we can use the problem to rig a special case of the 
general inequality so that we can prove it by induction. 


For example, let p = 3. We compute that (2!/3 — 1)—! = 3.847, which is less 


than 4. So we may choose N = 4. We then compute that = = a = i Therefore 


a special case of (2.47) is 
1 
(2.54) 2S 7” for all n> 4, 


If Solved Problem still seems a bit too abstract, we invite you to prove (2.54) 
in a more concrete way by substituting p = 3 and N = 4 into the proof of (2.47). 


While we are looking at exponentials, we might as well prove the following fun 
fact. 


Exercise 2.21. Prove that for alln € Zt, 


(2.55) SOG 41). 2 =2-243-2 44.2 4-.-4 (mn $1). =n", 
i=l 
2.2.4. Factorials versus exponentials. As we have seen, in the race to infinity, 
exponentials beat polynomials. Now we will see that factorials beat exponentials! 
Recall that factorials are defined by 0! = 1 and n! = n(n —1)---2-1 for n € Z*. 
We will discuss factorials more formally in Example below. 


The first several factorials and powers of 2 are listed in Table [2.2.1] 


Table 2.2.1. Some positive integers and their factorials and powers of 2. It 
looks like the factorials may be winning the race to infinity. This is indeed 


true. 
ni} 2” 


1 2 
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Theorem 2.12. For any integer n > 4, we have 


(2.56) n! > 2”, 


Proof. Let P(n) be the statement n! > 2”. 
We observe that 4! = 24 > 16 = 2*, so that the base case P(4) is true. 
Observe that P(n) is equivalent to the statement Q(n) given by 


(2.57) — >i. 


For the sequence of statements Q(n), we can follow a similar pattern of proof as 
for Theorem [2.6] and we invite the reader to work this out. 

For variety (after all, it is the spice of life!) we proceed slightly differently: 
Suppose that an integer k > 4 satisfies k! > 2*; i.e., suppose that P(k) is true. 
Then we calculate that 


(2.58) (k+1)!= (k+1)- kl > (k+1)-2* > 2.2% = oF, 


where we used k! > 2” (our induction hypothesis) for the first (strict) inequality 
and that & + 1 > 2 for the second (weak) inequality, which is trivially true since 
k > 4. Hence P(k + 1) is true. 


By mathematical induction (Axiom [2.8), we conclude that n! > 2” for all 
n> 4. 


Alternate approach to the inductive step. By wishful thinking (which in 
this case works!), we speculate that the ratio 


(2.59) kw — 
is a monotonically non-decreasing (or perhaps, more strongly, increasing) function 


of k. So we consider trying to prove the speculative inequality 


(k+1)!_ ki 


By elementary algebraic manipulation, we see that this inequality is equivalent to 
the following inequality: 
! k+1 
_ (k+1)! x 2 95 
kl! Qk ; 
which is plainly true for k > 1 (with strict inequality if k > 1). Thus, if # > 1, 


k+1 


then (er > 1, and so the inductive step is proved! 


Commentary. Looking back at the proof, we see that inequality (2.58) holds 
for all k > 1. This is because we only used the induction hypothesis and the 
inequality k +1 > 2, which is true only assuming that k > 1. Therefore the 
following statement is true: 

(2.61) P(k) implies P(k +1) for all k > 1. 
We can visualize all of this as 
(2.62) PU) = Pa) = Ps) = P(4) => Pd) Se 
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Here, a green implication or statement is true, whereas a red statement is false. 
By (2.62), we have that if P(1) is true, then P(2) is true. But P(1) is false, which 
means this implication doesn’t tell us anything. It also says that if P(2) is true, 
then P(3) is true. But, similarly, P(2) is false. Eventually, we get to P(4) which 
we know is true! So, now we can conclude something, namely that P(5) is true. 


Exercise 2.22. Give an intuitive explanation for why the factorial n! grows faster 
than a” for every fixed positive integer a. 


Let a be a positive integer. Let no be an integer greater than a. Explain why 


(2.63) (2no)! > a”. 
Prove by induction that 
(2.64) (2n)! >a" for all n> no. 


We remark that Stirling’s formula gives the asymptotic behavior of n!: 
—: n 
(2.65) nl ~ Van (=) 
€ 
Roughly speaking, n! is more like n™ than a fixed base to the power n. 


Here is a sum formula involving factorials. 
Exercise 2.23. Prove by induction on n, for all positive integers n, 
1-1!4+2-2!43-3!4+---4n-nl=(n4+1)!-1. 
For more practice with inequalities, consider the following. 


Exercise 2.24. Prove by induction that for every integer n > 2, 


1, hk 1 
a ae a ee 


Can you also give a simple explanation (without using induction or integration as 
in the integral test) for why the inequality is true? 


(2.66) 


2.3. Sums of powers of the first n positive integers 


The idea of summing over positive integers can be replaced by summing over powers 
of positive integers. Let k be a positive integer. Consider the sum 


(2.67) Soa = 1h 42% 4 3h +... + nk, 

i=1 
In this section we will first give visual proofs of formulas for these sums for the 
powers k = 1,2,3. We then give a proof by induction of a formula for k = 4. 


Firstly, consider again the k = 1 case, where the sum is 


(2.68) An t= 1+2++:++n. 
Recall that adding this sum to itself in the reverse order yields 
1 + 2 eta ra 
(2.69) n + (n-1) +.--4 1 
(n+1) + (n4+1) 4+:--+ (n+4+1) 
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so we obtain 
(2.70) 2A, = 2(14+2+--:+n) =n(n+ 1). 


By dividing by 2, we obtain (2.2), which is the formula given by Theorem [2.1] We 
can visualize this algebra by arranging squares suitably as in Figure |2.3.1 


a 
oO 


Pen wre oO DN © O 


Oo 


1 2 3 4 5 6 7 8 9 10 11 


Figure 2.3.1. A picture is worth 2°53 words: since Aig = rainbow area, 
which equals the gray area, we have 2Ai9 = 10-11 is the total area of the 


rectangle. 
Secondly, for the case k = 2 we consider the marvelous formula (2.24) in Exer- 


cise 2.6} 

1)(2 1 
(2.71) GaP er ea ea en ) 
We picture k? as k? 1-by-1-by-1 cubes combined to form a k-by-k-by-1 rectangular 
solid. We fit together a 1-by-1-by-1 cube, a 2-by-2-by-1 cube, and a 3-by-3-by-1 
cube to form a mini-sculpture whose volume is equal to 17 + 2? + 37; see Figure 
[2.3.2] We can now fit three mini-sculptures together to get a mega-sculpture 
which almost looks like a cube. Figure [2.3.2]shows us how to slice the top halves of 
the top level of the mega-scupture to obtain a rectangular solid whose dimensions 
are 


(2.72) 3-by-4-by-3.5. 


3-4:(2-341) 
2 


Thus the volume of the mega-sculpture is . We conclude that the volume 


of each mini-sculpture is 

410341) 
nt 

This is precisely formula (2.71) for the case where k = 2 and n = 3. 


(2.73) 1 9 
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Figure 2.3.2. A visual proof that 12+2?+43?+.---+n? = an(n +1)(n4 3). 
Six times this sum is equal to n(n+1)(2n+1). Credit: Wikimedia Commons, 
author: Salix alba, licensed under Creative Commons Attribution-Share Alike 


4.0 International (https: //creativecommons.org/licenses/by-sa/4.0/deed.en) 


license. 


Thirdly, for the case k = 3 the formula for the sum of the third powers as stated 
in Exercise 2.9] can be rewritten as 


n 2 
8048 a. 8, 88 bn pees n(n + 1) 
(2.74) as , = 134293433 4...4n “(a 


By using (2.2), we see that this is equivalent to the formula visualized in Figure 
2.3.3] 


Fourthly, for the case k = 4 we have the formula 


(2.75) So i#=144+244344---4+n4 = —n(nt 1)(2n + 1)(3n? + 3n— 1). 


We prove this in Solved Problem below. 


Remarkably, there is a formula for the sum of the k-th powers for every positive 
integer k. We now make some comments about this, and we refer the interested 
reader to the literature for more discussion. Let B; be the j-th Bernoulli number| 
We will not go into the details about the definitions of the Bernoulli numbers except 
to say that the interested reader may consult the Wikipedia link for “Bernoulli 
number” and other sources (so don’t worry if you don’t know what they are!). 


Faulhaber’s formula] (which we will not prove) says the following: 


k 


s 1 1 k! B; ; 
2. je kt ok Oe eee 
78) di ro ee * aes" 


For example, for k = 4 we obtain the formula 


(2.77) 3 it = in? + sn + Bon? + Bgn? + Ban. 
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Figure 2.3.3. A visual proof| that 
13429433 $---+n3=(142434---4n)?. 


Wikimedia Commons, author: Cmglee, licensed under Creative Commons 


Attribution-Share Alike 3.0 Unported (https: //creativecommons.org/licenses/ 
by-sa/3.0/deed.en) license. 


Since By = a Bz = 0, and By = — 35 (the Bernoulli numbers can be computed 
using recursive formulas), we obtain 


n 


1 1 1 1 
A 5 4 3 
(2.78) 2 a 5” + ai + an 30" 


This quintic polynomial in n is factored as the right-hand side of (2.75), so indeed 
(2.75) is a special case of Faulhaber’s formula. 


Solved Problem 2.13 (Proof of Faulhaber’s formula for k = 4). Prove (2.78), 
that is, 


= 1 1 i 1 
Sa + rho n, 


i=l 


by induction. 


Solution. Here, it is convenient to use the Binomial Theorem [7.26] below, 
which says that 


(2.79) (a+b)" = ss ") a” ipi 
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For example, taking a = m and 6b = 1 we obtain for n = 2,3,4,5 that 


(2.80a) (m+1)? =m? +2m+1, 

(2.80b) (m+1)* =m? + 38m? +3m +1, 

(2.80c) (m+1)* = m4 + 4m + 6m? + 4m +1, 

(2.80d) (m+1)? =m? +5m* + 10m? + 10m? + 5m +41. 


We will use the case n = 4. As background, in (2.79), ") = _»! __ denotes the 
i 


Te i)! 
binomial coefficient n choose i. If you are not familiar with binomial coefficients, 
you may just use the fact that the coefficients of the polynomials in (2.80) are given 
by Pascal’s triangle. 


We prove (2.78) by induction on n. We will just check the inductive step as 
the base is easy as usual. Let 
ta 4 1 


(n) : + + 
— n? van n? n 
ales cal D 3 30 


denote the quintic polynomial on the right-hand side of (2.78). The inductive 
hypothesis is that m is a positive integer with the property that )7\", i4 = p(m). 
The desired inductive conclusion is that the formula =. i* = p(m +1) is true. 


By taking differences, we see that this is equivalent to 


m+1 m 
(2.81) (m+1)4 aur — Sit = p(m+1) — p(m). 
i=1 
Therefore, to prove the inductive step, we just need to show that for every positive 
integer m, (m+1)* equals p(m+1)—p(m). To verify this, we compute using (2.80) 
that 


(Gade ama 


((m b1)3 m*) 30 


=m'*+4m?> + 6m?+4m+1 
=(m+1)*. 


Since we have the desired equality, this proves the inductive step. By induction we 
are done. 


2.3.1. Relation of summing powers to integration. Let & be a positive integer. 
Define the n-th partial sum of the k-th power sum by 


(2.82) Sn = 18 42% 43% +... 40k. 
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Let 
(2.83) f(z)=a*, forx eR, 


be the k-th power function. We observe that 


(2.84) _ = . (2) (2) ++ (2) =57(2). 


Figure 2.3.4. The graph of the squaring function f(a) = x? and a Riemann 
sum partitioning [0, 1] into n = 6 equal subintervals and using right endpoints. 
The Riemann sum is approximately equal to the area under the graph, with 
equality in the limit as n — oo. 


In general, for a continuous function f : [0,1] > R, the of 
f over [0,1] using the right endpoints of a partition of [0,1] into n equal length 
subintervals is equal to 


“ i 

2.85 -th Ri — —|]A 

(2.85) n iemann sum > f (=) as 
where Az := +. By calculus, we know that 

1 Ba 1 
2.86 li -th Ri = dx = dz = ——. 
(2.86) Jim n iemann sum : f(a) dx | al 
By combining all of this, we obtain 
1 n 1 n 
(2.87) Fe tin 8 = Sie 
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Equivalently, we have the limit formula 


‘ Sn 
k+1 
Since the ratio of s, to oe tends to 1 as n goes to infinity, we will express 
this as 
“ 1 
2.89 i® = s,~ n’*1 for n large. 
(2.89) 2 eon g 


This asymptotic formula gives us more confidence in Faulhaber’s formula (2.76). 


That is, if we can prove a non-trivial consequence of a formula we are not sure 
about (e.g., a formula we haven’t proved), then we have more confidence in the 
formula. 


2.3.2. Revisiting visualizing the sum of the first n positive integers. Just like 
formal proofs, visual proofs are not unique. We often can come up with different 
visual proofs of the same result. For example, does Figure [2.3.5] convince you that 


1 
(2.90) 1424344n=(") )? 


Figure 2.3.5. The number of yellow circles, including the red circle, is equal 
to1+2+3+4---+n. There are n+ 1 orange circles, including the two red 
circles, in the last row. Each yellow circle is in one-to-one correspondence with 
a choice of two of the n + 1 orange circles. 
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Figure 2.3.6. Another rendering for n = 7 of the same visualization as in 
Figure Each yellow circle corresponds to a cardinality 2 subset of Ng. 


See Chapter [7] below for the definition of binomial coefficients. For now, we just 


n(n+1) 
2 


n+1 : 
remark that > read as +1 choose 2, is equal to , 80 we recover (2.2) 


from (2.90). 


2.3.3. Thinking about sums of squares. Can you explain why Figures [2.3.7] 
and 2.3.8] show that 


(2.91) 3(124 2? 43? 4 ea pen?) = (nt 10+ 2 494-40)? 
Observe that by applying (2.2) to this, we recover (2.24). 


Exercise 2.25. Can you complete from (2.91) a visual proof of the formula in 
Exercise [2.6]? 
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Figure 2.3.7. The sum of the integers in each of the three equilateral triangles 
equals 17 + 274 32 4+---+4+n?. 


Figure 2.3.8. Each position in the three triangles sums to 2n + 1. Moreover, 
there are 1+2+3-+----+ 7 positions in each triangle. 
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a, a2 a3 a4 a5 ag 


0 fi fe cE Sa fs Ie 
Figure 2.3.9. A visual explanation for why Exercise is true. 


2.3.4. Thinking about proving summation formulas by induction. In the so- 
lution to Solved Problem we employed a special case of the following general 
method for proving summation formulas. 


Exercise 2.26. Let {a,}°, be a sequence of real numbers, and let f : Z* — R be 
a function. Suppose that a, = f(1) and that for allk € Zt, 


(2.92) ans. = f(6+1) — f(h). 
Prove by induction that the formula 
(2.93) a, tag +a3 +--++4, = f(n) 


is true for alln € Zt. 


Can you reprove some of the induction exercises we have seen thus far by this 
method? 


2.4. Strong mathematical induction 


Now that we are comfortable with mathematical induction, we can think about 
improvements. Strong mathematical induction is a form of induction, which in any 
circumstance is at least as powerful as the mathematical induction we have thus 
far considered. 


2.4.1. Strong mathematical induction. We first describe strong mathematical 
induction in terms of the domino effect, and then we give a more formal presenta- 
tion. 


The strong induction axiom can be described as follows. Firstly, we assume, 
as before, that the first domino has been toppled. So the idea of the base case has 
not changed. But we will consider a different inductive step. Suppose we have, 
perhaps not as carefully as before, arranged the dominoes so that we have the 
following property. If the first & dominoes have all toppled, then the (k + 1)-st 
domino topples (this is the strong inductive step). Strong mathematical induction 
says that, under the assumptions above, for every n, the n-th domino eventually 
topples. 


We can think of strong induction allowing us to use more dominoes to topple 
the “next” domino. For example, changing from dominoes to Dalmations, if we 
want to topple the 101-st Dalmation, then by the strong inductive hypothesis, we 
have all of the first 100 Dalmations at our disposal to use for toppling Dalmation 
number 101! Note that we don’t always have to use all of the Dalmations at our 
disposal. For example, hypothetically, we might be in a situation where we only 
need to use Dalmation numbers 99 and 100 to topple Dalmation number 101. As we 
will see later, this is sometimes the case for results about the “Fibonacci numbers”. 
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In analogy to Axiom [2.5] formally, we may state the mathematical induction 
axiom as: 


Axiom 2.14 (Strong Mathematical Induction). Let P(1), P(2), P(3),... be an in- 
finite sequence of statements. Suppose: 

(1) P(1) is true. 

(2) If k € Z* has the property that P(1), P(2),...,P(k) are all true, then 
P(k +1) is true. 

Then P(n) is true for alln > 1. 


Figure 2.4.1. For the inductive step in strong induction, we suppose that all 
of the first k dominoes P;, P2,..., P,y—1, Py, have fallen and we need to prove 
that this implies that the (k + 1)-st domino P;,+1 falls. 


Hy 


2.4.2. Summary of strong induction. We can think of an infinite sequence of 
statements as a For mathematical induction, we are saying 
that if the first step exists and if we know the implication that the (k + 1)-st step 
exists provided the k-th step exists, then the whole Stairway to Heaven exists. 

For strong induction we are saying the following. If the first step exists and if 
we know the implication that the (k + 1)-st step exists provided the first & steps all 
exist, then the whole Stairway to Heaven exists. Figure 2.4.1] gives a visualization 
of strong induction. 


2.4.3. Existence part of the statement of the Prime Factorization Theorem. 
A wonderful application of strong induction is the following “existence” part of the 
Prime Factorization Theorem. 


Theorem 2.15. For any integer n > 2, n is prime or n is a product of primes. 


Proof. Let P(n) be the following statement: n is prime or n is a product of primes. 

Base case: P(2) is true because 2 is a prime. 

Strong inductive step: Suppose that k > 2 has the property that each 
of 2,3,...,k is a prime or is a product of primes (this is the strong inductive 
hypothesis). 

Case 1. k +1 is a prime. Then we are done. 

Case 2. k +1 is not a prime. By this assumption, there exist integers a and 
b satisfying k+1=a-band1< a,b < k+1; that is, 2 < a,b < k. By the 
strong inductive hypothesis, we know that a and b are each a prime or a product 
of primes. Hence their product a-b = k +1 is a product of primes. Figure [2.4.2 
gives a visualization of the inductive step for Case 2. 
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Figure 2.4.2. For the strong inductive step, we assume that all of the integers 
between 2 and k are products of primes. If & + 1 is not prime, then k + 1 = ab, 
where 2 < a,b < k. By the strong inductive hypothesis, both a and b are 
products of primes and hence so is their product k +1. (Here, for brevity’s 
sake, we have included in the meaning of being a product of primes the case 
of being a prime.) 


In either case, we have proved the desired strong inductive conclusion that k+1 
is a prime or a product of primes. 

By strong mathematical induction (Axiom [2.14), we have proved that P(n) is 
true for all n > 2. 


Remark 2.16. Another application of strong induction, to prime numbers, is 
Euclid’s bound for the n-th prime number in Theorem [4.15] below. 


2.4.4. Primes and the greatest common divisor. Primes have nice properties 
with respect to the greatest common divisor. We will be able to soup up properties 
by using induction. Firstly, we observe the following property. 


Proposition 2.17 (Commonality with a prime is an all-or-nothing proposition). 
If p is a prime and if a is not a multiple of p, then 


gcd(a,p) = 1. 
In other words, if p is a prime and if a is an integer, then 
(a is a multiple of p) or ged(a,p) = 1. 


In particular, if p and q are distinct primes, then gcd(p, q) = 1 (we proved this last 
fact in 41.5.3). 


Proof. Suppose that p is a prime and that a is not a multiple of p. Since p is a 
prime, its only positive divisors are 1 and p. Since gcd(a,p) is a positive divisor 
of p, we have gcd(a,p) = 1 or gced(a,p) = p. Suppose gced(a,p) = p. Then, since 
gcd(a,p) divides a, we have p divides a, which contradicts our hypothesis. We 
conclude that gcd(a, p) = 1. 

Note that if p and q are distinct primes, then neither can be a multiple of the 
other. Thus ged(p, g) = 1. 


For example, if M is the set of multiples of, say, the prime 7, that is, 
M ={7k:k €Z}, 
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then gced(n,7) = 7 for all n € M. On the other hand, by Proposition [2.17] we have 
that gcd(n,7) = 1 for alln € M. (“e” means “is an element of”, and “¢” means 
“is not an element of”. See Chapter [5]below for every set theory notation you have 
not seen before.) 

Can a prime have anything to do with the positive integers smaller than it, 
even if we multiply all of them together? The answer is provided by: 


Corollary 2.18. If p is a prime, then 


(2.94) gced(p, (p — 1)!) = 1. 
Proof. By Proposition [2.17 

1 = ged(p, 1) = ged(p, 2) = ged(p, 3) = --- = ged(p,p — 1). 
Thus, by Exercise 2.27] right below (which is proved by induction), we have 
(2.95) gcd(p, (p — 1)!) = ged (p,1-2-3---(p—1)) =1. 


For example, without having to do a calculation, we know that gcd(97, 96!) = 1. 
By the way, 96! ~ 101°°, so it would be hard to verify this fact without using the 
corollary! 
Exercise 2.27. Observe that Lemma [4.29] below says that if gcd(a,b) = 1 and 
gcd(a,c) = 1, then gcd(a, bc) = 1. For this exercise, we will boldly assume this fact. 


Let a and by, b2,..., by be non-zero integers such that gcd(a, bj) = 1 forl<i< 
k. Prove by induction that 
(2.96) gcd(a, bib ose br) =i), 


Hint: by -+- bea = (b1 «++ by) + Bega. 


2.4.5. Numbers of tilings by dominoes of a rectangle. Dominoes, besides 
being useful for toppling, are also useful for tiling. Let n € Zt. Consider the 
sequence of positive integers {t,}°2, defined by: 


tp is equal to the number of tilings of an n x 2 rectangle by 1 x 2 
rectangles (which we can think of as dominoes). 


We'll call {t,,}°2, the sequence of tiling numbers. Clearly, t; = 1, and a moment’s 
thought gives us that tg = 2. We can draw some pictures to compute t,, for n small. 
Figure [2.4.3] shows how to compute fj, ta, t3, and t4. 


We have recursive formulas for t3 and ta: 
Exercise 2.28. Using Figure |2.4.4| explain why we have the following formulas: 
(2.97) tj =ti +te and ty=tot+ts. 

And we have recursive formula for ts: 


Exercise 2.29. Verify that ts = t3 +t4. Hint: Mimic the idea presented for 
tz = ty + to and ty = to +t 3 in Figure 244] The secret to seeing all of these 
recursive formulas is to categorize tilings according to whether the rightmost tile(s) 
are vertical or horizontal! 


Exercise 2.30. Can you guess a recursive formula for t, for all integers n > 3? 
Hint: If you are not sure, the topic of the next section should give the answer away! 
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n=1n=2 n=3 n=4 


Figure 2.4.3. These arrangements convince us that tj = 1, te = 2, tz = 3, 
ta = 5. 


Figure 2.4.4. A visual proof that tz = t1 + tg and t4 = te + ts. 


2.5. Fibonacci numbers 


Fibonacci numbers have numerous applications, for example, the Fibonacci search 
technique in computer science. In this section, we'll learn about these important 
numbers. 
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Figure 2.5.1. [Fibonacci] (1170-1250). Wikimedia Commons, Public Domain. 


The form a sequence of positive integers. Let us denote 
the n-th Fibonacci number by f,,. We define the first two to be one: 


(2.98) fi = fo =1. 
The rest of the Fibonacci numbers can now be defined by induction: 
(2.99) fn = fn-2+fr—1 for n> 3. 


We see that fs = fit fo=1+1=2, fa = fot fs =1+2 =3, and so forth. 
The first several Fibonacci numbers (those less than 1000) are 


(2.100) 1,1,2, 3, 5,8, 13, 21, 34, 55, 89, 144, 233, 377, 610, 987. 
The next several Fibonacci numbers (those less than 1000000) are 


1597, 2584, 4181, 6765, 10946, 17711, 28657, 46368, 
79025, 121393, 196418, 317811, 514229, 832040. 


Observe how quickly Fibonacci numbers grow! 
For further discussion of inductive definitions, see §2.6] 


If you have solved Exercise 2.30] then you will see that the Fibonacci numbers 
are equal to the tiling numbers: f, = tn—1 for n > 2. 


Historical note. Fibonacci numbers appear in Indian mathematics and in par- 
ticular in the work of 


2.5.1. Patterns in the Fibonacci sequence. The Fibonacci numbers exhibit 
many interesting patterns. Sometimes we can find these patterns in the Fibonacci 
sequence by methodically applying simple ideas. For example, let us start with a 
Fibonacci number and “push it down”: 


fs = fa + fs 
= f3+ fst fe 
=fatfetfrth 
=fgt+fetfctl. 
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Shall we see how to do this for the next Fibonacci number? Similarly, we compute 
that (using a different color coding of the Fibonacci numbers) 


(2.101) fe=fst+fa 
= fat+ fat fa 
= fat fa t+ fat fo 


=fat+ftfetfothi 
=fat+ft+fetfAti. 


Figure [2.5.2] shows a way to see the calculation above visually. 


fe| = | fs| fa 


fa |e 
fa 


fa| f3| f2 
fs 


fa| fs| fal fi 


= fa|f3|f2| fil 1 


Figure 2.5.2. A visual rendering of the calculation in equation (2.101). 


As you may expect, the formulas above are part of a general pattern, given by 
the following. 


Solved Problem 2.19. Let f,, be the n-th Fibonacci number. Prove that for all 
positive integers n, 


(2.102) So fi= fit fot fate + fn = Inte — 1. 
41. 


Solution. Base case: n = 1. We have 


So f= fi =1=2-1=fgs-1=fire—1. 


So the base case is true. 


Inductive step. Suppose that k is a positive integer satisfying 


k 
(2.103) So fi = frs2 — 1. 
t=1 
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Using Spears Te pam fi) + fr+i, the inductive hypothesis, and the Fibonacci 
recursive definition, we compute that 


k+1 k 
yh (>: i) sie 
4=1 


i=1 
= froo—-1t frst 
= fri3—1 

= F(k4+1)42 1, 


This proves the inductive step. By induction we are done. 


The Fibonacci numbers satisfy a plethora of identities. Here are two more: 


Exercise 2.31. Prove that for all positive integers n, 


(2.104) fit fat fate + font = Jon 
Exercise 2.32. Prove that for all positive integers n, 
(2.105) fot fat foet--++ fon = fonti — 1. 


The Fibonacci numbers satisfy nice divisibility properties. 


Exercise 2.33. Prove that for every k € Zt, the 3k-th Fibonacci number f3p is 
even. Hint: If you are having difficulty, see the following solved problem. 


Solved Problem 2.20. Prove that for every k € Z*, the 4k-th Fibonacci number 
fax is divisible by 3. 


Solution. We have that f, = 3, which of course is divisible by 3, so the base 
case is true. 


As the inductive hypothesis, suppose that k is a positive integer for which 4; 
is divisible by 3. We then compute that (we go, step-by-step using the Fibonacci 
recursive definition, from 4k + 4 back to 4k) 

facet) = Santa 
= farsa t+ farts 
= farsa + (farsi + far+e) 
= fargi + 2far+e 
= fara + 2(far + far41) 
=2 far + 3 farsi. 


Now, since /4; is divisible by 3 by our inductive hypothesis and certainly 3 is 
divisible by 3, their integral linear combination 


(2.106) (2) fax + (fan+1)3 


is divisible by 3. We conclude that f4(,41) is divisible by 3. This proves the 
inductive step. 


By induction, we are done. 


Exercise 2.34. Prove that for every k € Zt, the 5k-th Fibonacci number fs, is 
divisible by fs. 
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The following exercise, although a bit more difficult, is based on a similar idea. 


Exercise 2.35. Prove that for every k,€ € Z*, the ké-th Fibonacci number fre is 
divisible by both fy and fe. 


Hint: You may for example prove this by induction on €. You may wish to 
write out explicitly the statements P() that you will prove. 


The Fibonacci sequence has interesting patterns regarding sums that are qua- 
dratic in the Fibonacci numbers. 


Exercise 2.36. Prove for alln € Z* that 


(2.107) WOR HA+H+BR + +h = fofo 
t=1 


Figure 2.5.3. An illustration of formula (2.107) in the special case that n = 8: 
1241? 42? 437 +57 +8? +13 + 21? = 21-34. 


Déja vu all over again? This illustrates a rather convincing argument for 


(2.107), would you not say? 


A more challenging formula, which is quadratic in the Fibonacci numbers, is: 
Exercise 2.37. Prove that for all m,n € Zt, 
(2.108) Smtn = fm-1fn + fmfn41- 
Hint: You may for example prove this by strong induction on n. 
Exercise 2.38. Prove that for all positive integers n, 
(2.109) fa + fata = font 
Hint: You may use (2.129) below. 
For sequences, the index is often a positive integer. For the Fibonacci sequence, 


we can naturally extend it so that the index is any integer, positive, zero, or nega- 
tive! 
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Exercise 2.39. Recall that we defined fo = 0. We can extend the Fibonacci se- 
quence to negative indices by defining for negative integers n, 


(2.110) tin = fn+2 —_ Ina 


(Observe that this formula is actually true for all n € Z since it simply says that 
fn oF fn41 = faa) 
We also obtain that 


(2.111) foo), fs] 1, feo fs, ee 
And we compute that f_11 through fo are given by 

(2.112) $0, 255 Se 0, 18, a a 
Prove the forn € Z*, 

(2.113) fen = (-1)"*" fa. 


Hint: Use strong induction. 


The following two exercises also use strong induction. 
Exercise 2.40 (Exponential bound for Fibonacci numbers). Prove that 


(2.114) fn <2” for n>1. 


Exercise 2.41. Prove by strong induction that every positive integer is the sum of 
distinct Fibonacci numbers. 


For example, 1 = f; (or fo), 2= f3,3 = fa,4= fitfa, 5 = fs,6= fatfatfa. 


Hint: Ifn is not a Fibonacci number, then let f be the largest Fibonacci number 
less than n, and observe that 


(2.115) n= f+(n—f). 
Can you prove that f >n—f? Apply the strong inductive hypothesis to n — f. 


2.5.2. Binet’s formula. Miraculously, there is an explicit formula for the Fi- 
bonacci numbers: Binet’s formula, which looks like it might be hard to prove, 


but, with a bit of care, turns out to be relatively easy. 
Theorem 2.21. Let 


en 1S 1-75 


(2.116) 3 o, B= <0. 
For each n € Z*, the n-th Fibonacci number satisfies the formula 
a” — Br 
(2.117) fn = 
That is, 
(44) - (4) 
(2.118) (ee oes : 
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Proof. Observe that a and £ are roots of the quadratic polynomial 


(2.119) ee ee ee Te 
that is, 
(2.120) a? —a-—1=0, p?-Bp-1=0. 


This explains why a+ 8 = 1 (i.e., the average of a and £ is $) and also why a and 
6 are special. 


The number a is called the 


Let the statement P(n) be the formula f, = Sad “. A calculation yields 


(438) (254) (334)° (258) 
(2.121) (2 Me . 

, V5 v5 , 
Thus P(1) and P(2) are true. These two statements form the base case. 


Now suppose that & > 2 has the property that P(k — 1) and P(k) are true. 
Then 


qk-t_ a ak _ ge 
2.122 ies = 
(2.122) Se-1 ys fr oe 
We want to prove that P(k +1) is true. By these two formulas and the inductive 
definition of the Fibonacci sequence, we compute that 


(2.123) ag a 
(Bh = I a BK 
~ = | V5 
= 1 (o**(1 +0) + B*-1(1 + 8)) 
=Ze(a R192 4 gh1g2) (by (IO) 
1 


= VE a + p=) : 


This proves that P(k + 1) is true! 
By strong mathematical induction, we conclude that P(n) is true for all n € 


rae 


2.5.3*. Another proof of Binet’s formula. We may view the inductive formula 
frso = froitf, for k > 1 for the Fibonacci numbers as a vector equation for column 
vectors of consecutive Fibonacci numbers: For k > 1, 


Sr+2 1) a fei 
2.124 = ; 
re ( ae i Oe) \ 
Indeed, the vector equation (2.124), using matrix multiplication on its right-hand 


side (take the dot products of the rows of the 2 x 2 matrix with the column vector), 
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says that 
fe+2 = froit fe 
Seta = frti, 
where the second equality is tautological (obvious). 


We may group two consecutive pairs of consecutive Fibonacci numbers together 
to get the 2 x 2 matrix equation 


fet2 Sets 1 1 Seti fre 
2.125 = 
a Gre 1 0)\ fe fe 
for k > 1, where fo = 0 by definition (so that fz = fo + f1). This matrix equation, 
like the vector equation (2.124), encodes the Fibonacci recursive definition. 


For k = 1, (2.125) says that 
i Je) 7 24 fe fr f 2d ° 
fa fi 1 0 fi fo Op: 
For k = 2, (2.125) says that (we use the previous display) 
(e a © le fae a 
fs fa 1 0 fo fi 1. Oy 
So one expects that the following formula is true: 
k 
‘ee )-G 0) 
2.126 = ‘ 
( ) ( fe fk-1 1 0 


Indeed, we can prove this formula by induction. We leave such a proof to the reader 
as an exercise. 


Exercise 2.42. Prove (2.126) for all positive integers k, where the 0-th Fibonacci 
number is defined to be fo = 0. 


With the help of some elementary linear algebra, we can use the matrix for- 


mula (2.126) as follows. The eigenvalues \ of the matrix ( ; ; ) satisfy the 
characteristic equation 


1-A 1 
i ah 
=(1-—A)(-A)-1-1 
= \-r-1, 
This is the familiar equation (2.120)! 
1 5 1-—¥V5 
Soa= == and 6 = tie are the eigenvalues of ( ; : ), The eigen- 


value a has ( ‘i ) and the eigenvalue 6 = 1— a has eigenvector 
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( : ). That is, 
-—a 
1 oi a\ | a 
1 0 a eas ee 
Thus, using (2.126), we calculate that 
fast fe a ae oa eo eee 
te fe-1 1 1 0 uf 
The second row of this equation says 


ofc + fei = a*, 


me © 
So 


so that 
(2.127) a? fy tafe = art. 


Similarly, we also have 


Cre Gy ees ae. 
te fe-a —a 1 0 —a —a )- 
The second row of this equation says 
(2.128) fr — Ofx—1 = —B*a. 
So, by adding and (2.128) together, we obtain 
(a7 +1) f, = a**1 — B¥a = a(a* — B*); 
that is, 


_ (,k k a _ oF — oF 
fe = (a a rag V5 ’ 


where for the second equality in the display above we used that 


ae ue nae 


e+1= 


This reproves Binet’s formula. 


Exercise 2.43. Prove that for all integers n > 2, 
(2.129) frtifn—1— fa = (-1)”. 
Hint: Determinant! 


Recall that Z2 denotes the set of non-negative integers. We remark that, more 
generally, the Vajda—Everman identity says that for i,j,n € Z2, 


(2.130) Sntifn+s — Infn+i+g = (-1)" fi. 
But we do not ask you to prove this! 
Remark 2.22. Figure [2.5.4Jis the basis for another solution to Exercise [2.43] 


Exercise 2.44. Prove for alln € Z* that 
2n—-1 


(2.131) S- fifisa = fifo + fafa + fafa t+++ + fon—1fon = fin: 
i=1 
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fn—2 fad 


fn4i 


il 
I 
i) 
fn-1 1 
| 
I 
1 


Fn-1 fn fn+1 


Figure 2.5.4. Can you deduce from considering the areas of the rectangles 
in this picture the recursive identity: fn4ifn—1 — f2 = fa 5 fnfn—2, Or 


equivalently, fn+ifn—1 — f2_1 = f2 — fnfn—2? 
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© 
© 
©) 
©) 
©) 
©) 
©) 
©) 
©) 
© 


OOS © &9 es) @9 (2) @) GY @) 


Figure 2.6.1. The diagonal sums of numbers in |Pascal’s triangle equal the 


Fibonacci numbers: Summing diagonally the numbers in circles of the same 
color yields: 1 (red), 1 (orange), 2 (yellow), 3 (green), 5 (blue), 8 (purple), 
13 (red), 21 (orange), 34 (yellow), 55 (green), 89 (blue). See Chapter [7] for 
a discussion of binomial coefficients and Pascal’s triangle. Can you find the 


proof, or anywhere? Spoiler alert: see {7.4.5]below. 


In this section we discuss |recursive definitions (a.k.a. inductive definitions), of 


which the Fibonacci sequence is an example. 


Example 2.23 (Non-negative powers of real numbers). Recall that the k-th power 
of a real number b is defined by 
(2.132) bF := b-b-- +8. 
S~__ 
k times 
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Using inductive definition, we can make this definition more formally rigorous. 
Let 6 be a real number. Define the function f : R — R by 


f(x) = ba. 


This is simply the linear function given by multiplication by b. As we will see by 
Theorem[2.24] below, there exists a unique function A: Z7 > R satisfying A(0) = 1 
and the recursion relation 


(2.133) A(k +1) =b- A(k) 
for all k € Z2. The k-th power of b is formally defined by 
(2.134) b® := A(k) for k € Z. 


From A(0) = 1 and (2.133), we see that 
P=1, b=). =b-1=b, P=d-F'=b-b, B=b-b?=b-b-5, 
etc., in agreement with (2.132). 


By mathematical induction, we can prove the following, which we used in the 
example above. 


Theorem 2.24 (Recursion theorem). Let X be a set (if you prefer to be more 
concrete, you may take X to be the set of real numbers R) and let f : X > X bea 
function. Then, for every element a, € X, there exists a unique function 


(2.135) A:Zt 3X 
satisfying 
(1) A(Q1) = a1, 
(2) A(k+1) = f(A(k)) for allk € Z*. 
If we denote a, := A(n) for n € Zt, then we have obtained by recursive 
definition a sequence (actually, by Definition [5.32] below, A is already a sequence) 
(2.136) {an}, = {a1, a2, a3,...}. 


Remark 2.25. In the theorem above, we can replace the domain Zt by the set 
Z2, with the obvious resulting changes, e.g., A(0) = ao. 


Remark 2.26. Theorem |2.24|actually defines the iterates of the function f : X —> 
X. Namely, for each k € Zt, 


(2.137) fF: X AX 


is defined by f*(a1) := A(k) for every aj € X. In particular, f'(ai) = f(a1), 
f?(a1) = f(f(a)), ete. 


Example 2.27. Suppose that we choose instead a9 = A(0) = 3, and we define 
f:R-R by f(z) = §. Let ay, = f*(ao) for k € Z*, where f° := Ip is the identity 
function. Then 


3 
(2.138) @,=-— for keZ=. 
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Proof of Theorem Existence of A. Let S Cc Z* be the set of n such that 
there exists a function A, :N, := {1,2,3,...,n} > R satisfying 
(1) An(1) = a1, 
(2) An(K+1) = f(An(k)) for all l<k<n-1. 
We will prove by induction that S = Z*. 
Base case. Clearly 1 € S since we may define A; : N; = {1} > R by 


A,(1) =a. 
Inductive step. Suppose that m € S. We define Ajm4i:Nm41 > R by: 
(i) Am4i(k) = Am(k) for 1 <k <m. 
(ii) Am+i(m + 1) = f(Am(m)). 
Since m € S, we have 
(2.139) Am+i(1) = Am(1) = az. 
Using also that m € S and (i), we have 
(2.140) Amaa(k +1) = Am(k +1) = f(Am(&)) = f(Ams(B)) 


for 1 <k <m-—1. Furthermore, by (ii) and corresponding to the k = m case, we 
have that 


(2.141) Am+i(m +1) = f(Am(m)) = f(Am+i(m)). 
Thus, (2.140) is true for 1 < k < (m+1)-—1. By this and (2.139), the proof of the 
inductive step is complete. By induction, we conclude that S = Zr. 
Now, since S = Z*, we can define the function A: Zt > R by 
(2.142) A(n) := An(n) for ne Zt. 
This function satisfies (exercise!) 
(a) A(1) = a1, 
(b) A(k +1) = f(A(k)) for all k € Z*. 
We have proved the existence of A. 
Remark. We actually have Aly, = An (restriction) for n € Z* (see 45.3.5.2] 
below for the definition of the restriction of a function to a subset of its domain). 
Uniqueness of A. Suppose that we have a second function A’ : Zt 4 X 
satisfying A’(1) = a, and A’(k+1) = f(A’(k)) for all k € Zt. 
We will now prove by induction that A(n) = A’(n) for all n € Zt. 
Base case: n = 1. By assumption, A(1) = a; = A’(1). 
Inductive step. Suppose k € Zt is such that A(k) = A’(k). Since f is a 
function and by our hypotheses, we have 


(2.143) A(k +1) = f(A(k)) = f(A’(k)) = A(R +1). 


By induction, we are done. This proves the uniqueness of A. 
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A slightly more general form of recursive definition is as follows. We don’t 
bother to prove this theorem. The interested readers are very welcome to prove it 
for themselves. 


Theorem 2.28. Let X be a set, let ay € X, and let f : X x Z2 + X be a function. 
Then there exists a unique function A: Z2 — X satisfying 


(1) A(0) = ao, 
(2) A(k +1) := f(A(k),k) fork € Z2. 
Example 2.29. Recall that the of a non-negative integer is defined by 
(1) O! :=1, 
(2) n}=1-2-3---(n—1)-n forneZ. 
So we see the recursion relation 
(2.144) (kK +1)! = (k+1)-k!. 
In terms of the general recursive definition in Theorem [2.28] we are taking 
X =Zt, aj :=1, and 
(2.145) f(m,k) := (k+1)-m. 
So A(0) = 1, and 
A(k +1) = f(A(k),k) = (K+ 1) - A(z) 
for k € Z2. We see that 
A(k) = kl. 
Another generalization of Theorem [2.24] is as follows (again, we don’t prove 
this, although the proof is not difficult). 


Theorem 2.30. Let X be a set and let f: X x X + X be a function. Then, for 
every element a1,a2 € X, there exists a unique function A: Z* — X satisfying 


(1) A(1) =a, A(2) = ao, 
(2) A(k+1) = f(A(k — 1), A(k)) for integers k > 2. 
An example of this is the Fibonacci sequence { f,, }°°., discussed in §2:5] where 

we take 
(2.146) f(t,y):=a+y 
in Theorem Indeed, in this case we also take a, = aj = 1, and we compute 
that 
A(3) = f(A(), A(2)) = AQ) + AQ) =141=2, 
A(4) = A(2)+ A(3) =14+2=3, 
etc. We leave it to the reader to verify that f, = A(n) for all n € Z*. 


Of course, we can have more general versions of recursive definitions. The 
discussion above is just a sample. 


(2.147) 


Exercise 2.45. Define inductively the sequence of numbers {gn}°5 by go = 1 and 
(2.148) Qn = 29n-1 +2” for n>1. 
Prove that gn = (n+1)2” for alln > 0. 
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2.7. Arithmetic and algebraic equalities and inequalities 


We can prove some easy but fundamental algebraic identities using induction. We 
start with something more fun. 


2.7.1. A philatelic puzzle. 


Exercise 2.46. Prove that for every integer n > 12 there exist non-negative inte- 
gers a and b such that 


(2.149) n = 4a + 5b. 
A philatelic way to describe this is that any postage amount which is at least 12 cents 
(oh, remember the good old days!) can be made from 4-cent and 5-cent stamps. 


Hint: A case pertinent to the idea of the proof of the inductive step isn = 15. 
Also, split up the proof into two cases, where one of the cases is related to the 
aforestated hint. 


2.7.2. Induction and even and odd. We consider some results regarding parity 
that can be proved by induction. 


Solved Problem 2.31. Prove by induction that the sum of any n even integers is 
even for alln € Zt. 


Solution. The base case is obvious since the sum of one integer is itself, so we 
just prove the inductive step. Suppose that k € Z* has the property that the sum 
of any & even integers is even. Let a1, da2,...,@%41 be even integers. Then 


ay +g ++++ + Ge41 = (1 +2 +--+ + ae) + Ge41- 


Now both a; + a2 +--:+ ax and ag+1 are even, so their sum is even (by Theorem 
(1.24). This proves the inductive step. 


By induction we are done. 


Exercise 2.47. Prove that for every positive integer n, if a is even, then a” is even. 
Note that this exercise generalizes Exercise Did you need to use induction? 


Exercise 2.48. Prove by induction on n that for every positive integer n, the 
product of any n odd integers is odd. 


Exercise 2.49. Prove that for every positive integer n, if the product of n integers 
is even, then at least one of the integers is even. This exercise generalizes Exercise 


115 

2.7.3. Induction and the distributive law. In this subsection we consider more 
general forms of the distributive law. 
Exercise 2.50. Prove that if k € Z* and if b,a,,a2,...,a% are real numbers, then 
(2.150) (a1 tag +++: +ap)b = a,b + agb+---+axb. 


For this exercise you may assume right distributivity for the “field” of real numbers 
with addition and multiplication (see {8.7])). See Figure for a visualization of 


this formula. 
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ay a2 a3 Qk-1 ak 


Figure 2.7.1. A visual proof that (a1 +a2+---+a,)b = a1b+a2b+---+axb. 
We show this proof since it is helpful for understanding different points of view. 


Exercise 2.51. Prove that ifk € Z* and if a1, a2,...,ax,61,b2 are real numbers, 
then 


k k k 
(2.151) (>: «) : (by + bz) = a,b, + > a;ba. 
w=1 t=1 i=1 


See Figure [2.7.2] for a visualization of this formula. 


be 


by 


ay a2 a3 Qk-1 ak 


Figure 2.7.2. A visual proof of (2.151). 


We have the following formula for the product of two sums. 


Exercise 2.52. Prove that if k,€ € Z* and if a1,a2,...,ax,61,b2,...,b¢ are real 
numbers, then 


k £ £ k £ ok 
(2.152) (>: «) . S57; as S- (doa) — S- Gide, 


j=l \i=l j=l i=1 
For example, if k = € = 2, then this says that 
(2.153) (ay + az) (by + bz) = (ayby + aby) + (ayb2 + ab). 


Hint: You may for example prove this by induction on é. 


The formula above generalizes to products of a finite collection of sums. 


Exercise 2.53. Without going through all the details, describe how you might go 
about proving the following by induction: Ifr € Z*, ki,ko,...ky € Z*, anda ER, 
where l<i<rand1<j< kj, then 


é ko ky 


ki ky 
(2.154) TE | So ait | = 0 SS YE aajaays sary, 


t=1 \Fi=1 Jr=1 Jo=1ji=1 
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For example, if r= 3 and ky = kz = k3 = 2, then this says that 


(G11 + @12)(d21 + a22)(a31 + a32) 
2 


2 2 
= s s 14,425 43k 
1 1 


k=1 j=1 i= 
2 2 2 2 

=) s a14425431 + > s Q1; 0275432 
j=l i=l j=l i=l 


I 


2 2 2 2 
s Q4;421431 + S a1,422431 | + s Q14421432 + s 414422032 
i=1 i=l i=l i=l 


Q11421031 + @12021431 + 11422031 + 412022431 


l 


+ G11421032 + 412421432 + A11422432 + 412422432. 


2.7.4. Induction and division. Here we consider powers and division. The 
following result may be interpreted as saying that the integers (5 + 1)” and 1” 
have the same remainder 1 when divided by 5. For more general results regarding 
division, see Chapter [4] 


Exercise 2.54. Prove that for alln € Z*, 6"—1 is divisible by 5. See Figure 22.7.3] 
for a visualization of the inductive step. 


6” — 1 1 


6” 


Figure 2.7.3. Visualizing the inductive step to prove Exercise[2.54|by consid- 
ering the total area of the shaded region as the sum of the areas of the yellow 
and green rectangles. Note that the area of the big rectangle is 6” -6 = 6"+1, 
whereas the area of the small white square is 1. 


Exercise 2.55. Generalize the previous exercise as follows. Let k be a positive 
integer. Prove that for alln € Z*, (k +1)" —1 is divisible by k. 


2.7.5. Induction and algebraic inequalities. The following result, proved by 
induction, alternatively follows from the fact that for a finite set of real numbers, 
their minimum is at most their average, and their maximum is at least their average. 
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Solved Problem 2.32. Let n be a positive integer. Prove that for every real 
number b, a1, 42,.--,;@y, such that 


(2.155) a1 +ag +--+ +an =, 
there exists 1 <i <n such that a; < 6 and there exists 1 < 7 <n such that a; => 6 


Solution. We prove this by contradiction. Suppose there exist n € Zt and 
real numbers 0), a1, @2,...,@n, such that 


(2.156) a1 +ag +--+ +an =, 
but there does exist 1 <i < n such that a; < o. Then we have a; > o for all 


1<i<n. This implies that 
(2.157) b=a,+ag+---+a ee a 
: — 41 2 a Ln nm 


which is a contradiction. 
This proves the first statement of the problem. The second statement is proved 
similarly. 


An alternative proof follows from the existence of a minimum element. Namely, 
by Theorem [Z.46] below, there exists 1 < i <n such that a; <a; for alll <j <n. 
We then have that na; < pam) a; = 0. 


Remark 2.33. In the proof, we used the following fact. If n € Z* and if a; > 0; 
for 1 <i<n, then 


(2.158) Sage 


This fact we can be proved, yes, by induction! (We leave this as an exercise.) 
Exercise 2.56. Prove the fact stated in Remark [2.33] 


Exercise 2.57. Prove that if aj, a2,...,@n and bi, b2,...,bn are non-negative real 
numbers satisfying a; < b; for alll <i<n, then 


(2.159) a1a2°':An < by b2--- by. 


Hint: Besides the base case n = 1, you may wish to prove the n = 2 case, which 
may then be used for the inductive step. 

The following may be relevant to the next exercise: Is an analogous statement 
true where we replace weak inequalities < by strong inequalities <? 


Exercise 2.58. Suppose that n = a,a2:--axz, where all of the letters are positive 
integers. Prove that there exists 1 <i<k such that 


(2.160) ay < Wn. 
Hint: Prove this by contradiction. See Figure for a visualization when k = 2. 
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ay 


4/2 1a2 a2 
4/142 


Figure 2.7.4. An impossible picture. Visualizing the k = 2 case of a proof by 
contradiction of (2-160). If a1,a2 > ,/aiaz, can the area of the purple square 
with side lengths \/ai1az2 equal the area of the orange a1-by-a2 rectangle? Do 
we need the a,’s to be integers, or can they be any positive real numbers? 


2.7.6. Thinking about proving inequalities by induction. clco thru -part2 
For some of the proofs by induction of inequalities we encountered earlier in this 
chapter, we proved a monotonicity property, which can be stated as follows. 


Exercise 2.59. Let {fn}, and {gn}, be sequences of positive real numbers. 
Suppose that f; > g and that for allk > 1, 


frti < Gk+1 
2.161 —_— : 
( ) fe ~~ Ok 
Prove by induction that for alln € Z*, 
(2.162) fn =9n, that is, dn > 1. 

Gn 

Remark 2.34. Observe that (2.167) is equivalent to 
(2.163) Inti Sn 

Gn4+1 Jn 


From this we see that 
(2.164) ch che hehe... 
91 92 93 94 
This explains why the exercise is true, although the formal proof of it is by induc- 
tion! 


Note that we can interpret (2.161) as saying that, term-wise, the sequence fy, 
increases faster than gz. 


2.8. Hints and partial solutions for the exercises 


Hint for Exercise As the hint says, the answer is given in 42.1.2] 


Hint for Exercise For the inductive step, we need to show that if k is a 
positive integer with the property that the formula 1+3+5+---+(2k—1) =k? 
is true, then the formula 1+3+5+---+(2(k+1)—1) = (k+1)? is true. To 
prove this implication, use the inductive hypothesis and the easy fact that 
14+3454+---+ (2(k+1)—-1) = (14+3454+---+(2k-1)) + (2(k +1) - 1). 


Simplify the resulting quadratic expression in k. 
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Hint for Exercise For the inductive step: Suppose that k is a positive 
integer such that >. (b+ ia) =k (44a +6) is true. Show that ean: +ia) = 


(k +1) (Ao + 5) is true. To do this, relate Reais +ia) to 0 4 (b + ia). 


Hint for Exercise The sum of the powers of r from 0 to k + 1 is equal to 
the sum of the powers of r from 0 to k plus r**+!. This will help with the inductive 
step. 

Hint for Exercise Use Exercise [2.4] with r = 1/2. Then take the limit of 
your formula for the partial sum s, as k > co. 


Hint for Exercise The inductive hypothesis is that k is a positive integer 


with the property that ee ?= meer) is true. 


Hint for Exercise The inductive hypothesis is that k is a positive integer 


with the property that 17+ 3?+---+ (2k-—1)? = ROR NET) is true. 


Hint for Exercise The easy fact et) (-1)' 2 = *_(-1)' 2 + (-1)* F? 
is useful for the ater step. You may need to factor a quadratic polynomial in 
k. 


Hint for Exercise For the inductive step, the quartic polynomial you will 
encounter has factors of k + 1 and k + 2 (to what power(s)?). 


Hint for Exercise You can prove this directly by induction, or use the 
previous results for summing i and 7?. 


Hint for Exercise For the inductive step, as is often an algebraic step for 
such problems, put ratios over a common denominator. 


Hint for Exercise This is similar in spirit to the previous exercise. 
Hint for Exercise [2 For the inductive step, showing that 


( Geet) O- Siay 7 


Hint for pre ious For the inductive step, you may come across the expres- 
sion (1 - wan) ait . Simplify this. 


will be helpful. 


Hint for Exercise Although P(k) = P(k+1) is true for all k > 2, it is not 
true for k = 1. 


Hint for Exercise |2.16} The problem with this “proof by induction” is in the 
inductive step. The{converse]of an implication is not “logically equivalent” to the 
implication. 


Hint for Exercise This is the same falsidical paradox as the “all horses are 
the same color” falsidical paradox. 


Hint for Exercise Apply the inductive hypothesis to 4*+! = 4.4". Then 
show that the inductive conclusion will follow from showing for k > 4 that Ak? > 
(k+1)?. By taking square roots, show that this is true whenever k > 1, which is 
certainly true if k > 4. 
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Hint for Exercise |2.19} The inductive hypothesis is: Suppose that a positive 
integer & has the property that for every line of k pandas and penguins, where the 
first animal in the line is a panda and the last animal in the line is a penguin, then 
somewhere in the line there is a panda directly in front of a penguin. 

To prove the inductive conclusion, we assume the following: We have a line of 
k +1 pandas and penguins, where the first animal in the line is a panda and the 
last animal in the line is a penguin. 

Now consider the first k animals and consider two cases: (1) The k-th animal 
in the line is a panda. (2) The k-th animal in the line is a penguin. 


Hint for Exercise Let us take J = 7 and B = 23 for example. (But you 
should choose different numbers.) Our first job is the find the least positive integer 
L such that: For each integer n > L, 


(2.165) > aw. 


Show that this is false for n = 2 and true for n = 3. Let n = 3 be the base case. The 
proof is similar to the previous exercise, that is, the same general scheme, except 
with different numbers. So mimic the previous exercise. 


Hint for Exercise By now, you should have had a fair bit of practice proving 
formulas by induction. For this exercise, you may find the elementary exponential 
function fact that 2-2*+! = 2*+? to be useful. 


Hint for Exercise Both a” and n! are products of n positive integers. In 
the former case all the n numbers are a, whereas in the latter case the numbers are 
the integers from 1 to n. Now suppose that n is much larger than a. We have 


a-—1 n n 
(2.166) n= ]Ji][i=@-!] fi 
i=l i=a i=a 


As n gets larger, we have more and more integers 7 between a and n where i > a 
(informally, > means much larger). 


We have 
(2.167) nl (a—1)! | are: 


an qe-1 qn-atl* 
The first factor is a positive real number independent of n, but it is rather clear 
that the second factor limits to co as n + oo. This is an intuitive explanation for 


! 
why = — 00 as n > oo. 


Suppose that k > no has the property that (2k)! > a*. Use that (2(k + 1))! = 
(2k + 2)(2k + 1) - (2k)! to prove the inductive conclusion. 


Hint for Exercise |2.23} For the inductive step, you may find the elementary 
identity (k + 2)-(k +1)! = (K+ 2)! to be useful. 


Hint for Exercise The inductive step boils down to proving the inequality 
Vk + Jet > Vk +1. To see this, multiply everything by Vk + 1, and simplify. 


The easier way (perhaps less rigorous) to solve the exercise is to observe that 


I> a for 1 <i <n and that there are n terms in the sum. 


vi 
Hint for Exercise Use (2.2). 


102 2. Mathematical Induction 


Hint for Exercise This exercise is rigged to work since 
i(k) + (F(R +1) — f(®)) = F(R +1). 


Hint for Exercise For this exercise, assume Lemma below. This 
makes the cases n = 1 and n = 2 easy. The inductive hypothesis is: Suppose k 
is a positive integer with the property that if a,b), be,...,b,% are non-zero integers 
such that gcd(a,b;) = 1 for 1 < i < k, then gcd(a, bib2--- bg) = 1. To prove the 
inductive conclusion, use this, the fact that by ---bg41 = (b1--- be) + be41, and the 
n = 2 case (Lemma [4.29). 


Hint for Exercise For calculating t3, if the rightmost tile is vertical, then 
by considering the two leftmost tiles, we see that there are tg = 2 such tilings. If 
the (two) rightmost tiles are horizontal, then there is just t; = 1 leftmost (vertical) 
tile. 


A similar description may be made for calculating tg. 


Hint for Exercise If the rightmost tile is vertical, then by considering the 
four leftmost tiles, we see that there are ty = 5 such tilings. If the (two) rightmost 
tiles are horizontal, then by considering the three leftmost tiles, we see that there 
are tz = 3 such tilings. 


Hint for Exercise We guess that ty, = tn—2 + tn-1. 


Hint for Exercise [2.31} You will find the easy identity f; + fs + fs +---+ 
focetay—1 = (fi + fa + fs +++++ fox—1) + fort useful. 


Hint for Exercise This exercise is very similar to the previous exercise. 


Hint for Exercise Mimic the solution to Solved Problem 2.20] You'll find 
that the calculations are slightly shorter. 


Hint for Exercise Again mimic the solution to Solved Problem [2.20] The 
calculations are slightly longer, but still quite manageable. 


Hint for Exercise [2.35} The base case £ = 1 is easily seen to be true. As our 
inductive hypothesis, assume that m is a positive integer with the property that 
frm is divisible by f,. To prove the inductive conclusion, show that fx(m41) = 
fem—1fk + femfe+1 and use the inductive hypothesis. 


Hint for Exercise Use the inductive hypothesis and the Fibonacci recursive 
definition to prove the inductive conclusion. 


Hint for Exercise We prove identity (2.108) by strong induction on n. 
The base case: n = 1 follows from the Fibonacci recursive definition. To prove the 
n = 2 case use the Fibonacci recursive definition and that fg = 1 and f3 = 2. 

For the inductive step: Suppose that identity (2.108) is true for 1 <n < k. 
Start with fm+ ati) = fm+k—-1+ fm+r, and use that (2.108) is true for n = k —1 
and n =k. 


Hint for Exercise Use (2.129) (twice) to rewrite f? + f?,,, and then use 
Exercise 2.37] 
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Hint for Exercise As our strong inductive hypothesis, assume that k > 2 
is an integer such that (2.113) is true for 1 <n < k. Using this strong inductive 
hypothesis, compute f_(x41)- 


Hint for Exercise As our inductive hypothesis, assume that k is a positive 
integer satisfying 


(2.168) fn <2” for l<n<k. 
Estimate fx41. 


Hint for Exercise For the strong inductive hypothesis: Suppose that the 
result is true for all positive integers up to and including n — 1, where n > 2. Let 
f be the largest Fibonacci number less than n. Clearly, 


(2.169) n=f+(n—f), 


where n — f € Z*. Show by contradiction that f > n— f. Then apply the strong 
inductive hypothesis to prove the strong inductive conclusion. 


Hint for Exercise Assume the inductive hypothesis that k is a positive 
integer with the property that 


G. fk aes el 
fe. tea-Jo hg * 


Hint for Exercise Use frsife—1 — ff = det frst de and the 
“ feo fe-1 


previous exercise. You will need to know some basic facts about determinants of 
2 x 2 matrices. 


Hint for Exercise As our inductive hypothesis, suppose that k € Zt 
satisfies ie fifisa = f$,. Using this and the Fibonacci recursive definition, 


compute eas fifiqi- 


Hint for Exercise [2.45} The inductive step follows from a straightforward cal- 
culation. 


Hint for Exercise The base case is n = 12. As the inductive hypothesis, 
suppose k > 12 is an integer for which there exist a,b € Z2 such that k = 4a + 5b. 
Split the proof into two cases: (1) a> 1. (2) a=0. 

How does the assumption that a > 1 help? 

When a = 0, we have k = 5b. Explain why b > 3. Observe that if you replace 
three 5-cent stamps by four 4-cent stamps, the postage increases by 1 cent. How 
does this help? 


Hint for Exercise No need to use induction. Let n be a positive integer. 
Use that a” =a-a"~! and that an even integer times an integer is even. 


Hint for Exercise Use induction, the identity a,aq +--+ @p41 = (a1a2-+-+ az): 
Gz41, and the fact that the product of two odd integers is odd. 
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Hint for Exercise Take the contrapositive of the implication comprising 
the statement of the previous exercise. 


Hint for Exercise As the inductive hypothesis, suppose that n is a positive 


integer such that for every real number 6, a1, a2,...,@n, we have 
(2.170) (ay tag +--+ +4n)b = a1b + agb+-++ + and. 
Now suppose 0, a1, @2,..-,@n41 are real numbers. Compute (a1 + a2 +--+: +4n41)), 


while using the usual distributive law and the inductive hypothesis. 


Hint for Exercise For the inductive step, we assume that k is a positive 
integer with the property that for every real number aj, d2,...,a%, 01, b2 we have 


k k k 
(2.171) (>: «) . (by + be) = S- ajo, + S- a;bo. 
i=1 i=1 i=1 


Now let a1, @2,...,@k+41, 61, b2 be real numbers. Compute ee ai) - (by + bg). 


Hint for Exercise We prove the statement by induction on @. The @= 1 
case follows from the previous exercise. Suppose that m is a positive integer with 


the property that for every real number aj, a2,..., dx, 01, b2,..., bm we have 
k m m k 
i=l j=l j=1 \i=1 
Now let a1, @2,...,@%, 01, b2,...,bm41 be real numbers. Compute So a; aes b; 


while using the previous exercise and the inductive hypothesis. 


Hint for Exercise We would prove this result by induction on r while using 
the result of Exercise [2.50) 


Hint for Exercise[2.54} For the inductive step, assume that k is a positive integer 
with the property that 6* — 1 is divisible by 5. By definition, this means that there 
exists an integer a such that 6* — 1 = 5a. Show that 6*+! — 1 is divisible by 5. 


Hint for Exercise Use the same method as the previous exercise. The only 
difference is that this exercise is slightly more abstract. 


Hint for Exercise Use that 
n+1 


n 
; ay = ) aj; + An+1- 
i=l i=l 


Also use the general fact that ifa > b and c>d,thena+c> 6+d. All of this will 
help you with the inductive step. 


Hint for Exercise Suppose that & is a positive integer with the property 
that if a1, a@2,...,a% and by, bo,..., bg are non-negative real numbers satisfying a; < 
6; for alll <i<k, then 


(2.173) a1a2°:':QAk < by bo +--+ bp. 


(That is, assume that the implication above is true for the positive integer k.) Now 
suppose that a,,d2,...,@,4, and b,,b2,...,b,41 are non-negative real numbers 
satisfying a; < b; for all 1 <i<k-+1. Continue (what do we want to prove?). 
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Hint for Exercise |2.58} This is not an induction proof per se. Since the set 
{a1,@,...,a%} is a finite set, it has a minimum element (this fact can be proved 
by induction and is discussed in a later chapter). How does a,a2---a, compare to 
the k-th power of this minimum element? 

Hint for Exercise Since all of the numbers are positive, inequality 
is equivalent to dota o ae By hypothesis, we have ft > 1. As the inductive 
hypothesis, suppose that & is a positive integer satisfying : > 1. Continue. 


Chapter 3 


Logic: Implications, 
Contrapositives, 
Contradictions, 

and Quantifiers 


Figure 3.0.1. [René Descartes] (1596-1650). Portrait by Frans Hals in Louvre 


Museum. Wikimedia Commons, Public Domain. 


Goals of this chapter: To get down to the business of proving theorems using 
logical reasoning. To give some fun examples of applying logic to solve riddles, which 
are |toy models) of theorems. To understand the basics of logic. To see proofs by 
contradiction, including the incredible fact that the square root of two is irrational. 
To discuss the key concepts of universal and existential quantifiers. 


Cogito, ergo sum.| — René Descartes 


Question: Does the rock Descartes is sitting on exist? 
Answer: We know it exists, but the rock doesn’t. 


Rebuttal: Those who believe in the doubt this. 
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We begin this chapter with some wrong proofs and some fun riddles and puzzles. 


The more rigorously minded reader may proceed directly to 43.5] (perhaps after 
skimming through the first few sections). 


3.1. The need for rigor 


Mathematics is based on logical deduction. In this book so far we have focused 
on presenting proofs with correct logical deduction. To expand the landscape of 
results we can prove, it is useful to lay down some more foundation for the logic we 
use to rigorously prove statements, which we do in this chapter. We have also seen 
that visual proofs are very effective and convincing, and so far we have presented 
correct visual proofs. But we present to you: 


3.1.1. A cautionary tale: Not all “ducks” are ducks. 


If it looks like a duck, swims like a duck, and quacks like a duck, 
then it probably is a duck. See Figure [3.1.1] 


We often use this type of reasoning in everyday life. And when we use this type 
of reasoning, we are usually correct! We may think of “superficial” as being what’s 
on the surface, and “deep” as being what’s beneath the surface. Sometimes what is 
deep agrees with what is superficial, but other times they are different. This is when 
we have to be careful with our reasoning. In the development of mathematics, by 
trial and error, humans have decided on how deeply and carefully we should think. 
So this decision of “rigor” is a social phenomenon. We will be conventional and 
present mathematics roughly from this point of view, which actually agrees with 
our personal viewpoint. 


4645556 


Figure 3.1.1. They look, swim, and quack like ducks. Are they ducks? Figure 
created using the LaTeX TikZducks package. 


If true, the following superficial and false result might cause the universe to 
spontaneously combust (just kidding). 
Non-Theorem 3.1. 
63 65 
a 
That is, by subtracting 8 from both sides, we get0O=1. © 


(3.1) 


Proof, not. The theorem follows immediately from the two claims: 


Claim 1. The union of the red, orange, yellow, and green regions is a right 
triangle with non-hypotenuse side lengths 5 and 13. Thus the sum of their areas is 
equal to $-5-13 = 8. 
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Figure 3.1.2. Two “triangles” that pass the duck test 


Claim 2. The union of the red, orange, yellow, and green regions is a right 
triangle with non-hypotenuse side lengths 5 and 13 minus a unit square ©. Thus 


the sum of their areas is equal to a -l= 83 


Where did we go wrong? 


Well, both regions in Figure [3.1.2]look like triangles. But, as your mother told 
you, looks can be deceiving! 


Let us look at the “triangle” in Claim 1, which is the top “triangle” pictured 
in Figure[3.1.2] The red triangle has non-hypotenuse side lengths 5 and 2, whereas 
the green triangle has non-hypotenuse side lengths 8 and 3. But for the “whole 
triangle” to really be a triangle, we need these two triangles to be similar. But 
they are not as 5/2 is not equal to 8/3. 

In fact, the true sum of the areas of the colored triangles is equal to 

863 bad 64 
(3.2) gg Tea ee tal 


which happens to be the average of © and ©. So neither Claim 1 nor Claim 2 
gives the correct area! 
3.1.2. More fallacies: Let us make up some axioms. 


Seen on a car bumper sticker: My karma ran over your dogma. 
Seen on a dog leash: My dogma ran under your karma. 


Besides wrong inductive or visual reasoning, we are attracted to wrong algebra 
rules because of their simplicity and beauty. For example, take the following. 


110 3. Logic: Implications, Contrapositives, Contradictions, and Quantifiers 


Non-Theorem 3.2. 


(3.3) 1+1= 
Non-Proof. We compute 

1 i@gisl 2. 
(3.4) 1+1 ae ca FS 1. 


Of course, the result above is false. © For its “proof” we only made one algebra 
mistake. So we see that it is important to know which rules are true and which 
rules are false. 


Solved Problem 3.3. Consider the fake formula 
a ec a+c 
mn bt ad b+d 


Prove or disprove each of the following statements: 


(1) For all positive real numbers a,b,c, d, equation (8.5) is true. 


(2) There exist positive real numbers a,b,c, d for which equation (8.5) is true. 


Solution. (1) For statement (1) to be true, (8.5) must be true for all positive 
real numbers a,b,c,d. So if there exist positive real numbers a,b,c, d for which it 
is false, then the statement is false. That is, to prove that the statement is false, 
we just need to come up with one example of a, b,c, d for which is false. This 
is easy: let a = b =c=d=1. In this case, equation says that (déja vu) 

1 1@14+1 2 
1 1 1+1 2 
which is false. We conclude that statement (1) is false. 


>) 


(2) To prove that statement (2) is true, we would just need to find one example 
of a,b,c, d for which is true. Unfortunately, we will show that there is no 
such example, and therefore statement (2) is also false. We can see this as follows. 
Equation is equivalent to the equation obtained by multiplying it by b+ d: 


(3.6) a(1+$)+e(1+5) -ate 


This in turn is equivalent to 
ad ‘ ch 
b d 


Since a, b,c, d are all positive, there are no solutions to this last equation. Therefore, 
there are no positive solutions to (8.5). We conclude that statement (2) is false. 


Exercise 3.1. Prove or disprove each of the following statements (about| elementary 
that start with the following: 


For all real numbers x and y: 
(1) (e@+y/=274+y?. 
(2) ett¥ = et + €¥. 


(3) In(a + y) = In(x) + In(y), where we assume in addition that x,y > 0. 


(3.7) 0. 


3.1. The need for rigor 111 


Hints (for this exercise and the next): Consider the quadratic case of the|Binomial 
and the identities e*T¥Y = e*e¥ and In(xy) = In(x) + In(y). 


Can rules that are not always true be sometimes true? The following shows 
that it depends on the types of functions we are considering. 


Exercise 3.2. Prove or disprove each of the following statements that start with 
the following: 

There exist positive real numbers x and y such that: 

(1) (a@+y) =a? 4+y?. 

(2) ett¥ = e* + €¥. 


(3) In(a + y) = In(z) + In(y). 


3.1.3. Beauty and the beast: Mistakenly assuming linearity. Sometimes we 
think too fast by assuming a beautiful “formula”. One way of slowing down our 
thinking is to check our “facts” against “reality”. Knowing when to slow down our 
thinking comes from experience. If one is cautious, one checks one’s calculations 
more. 


The beast, a.k.a. the so-called |freshman’s dream, is 


© n 
(3.8) (e@+y)"=a™+y", 
where n > 1 is an integer. 


For example, the n = 2 case of the freshman’s dream is 


(3.9) (x+y)? Sa? +47. 


To check this dream against reality, we may consider various simple cases and see if 
they are formulas we know the truth value of. For example, if we take x arbitrary 
and y = 0, then we obtain the formula 


(3.10) (2 +0)? = 2? + 0?, 
which is true since both sides are equal to x”. However, if on the other hand we 


take « = y = 1, then we get the non-sensical (false) formula 
(3.11) (iyi +1, 


that is, 4 = 2. So we conclude that the freshman’s dream is only a false dream! 
This disproves the statement in Exercise [3.1[1). 


Takeaway: The idea of checking a conjecture against special cases of the 
conjecture is very useful. 


A similar misconception is: For all non-negative real numbers x and y we have 
© = 
(3:12) Vaty=Vrt+ vy. 
Exercise 3.3. Can you disprove formula (8.12) ? 


Exercise 3.4. In view of these examples, can you explain the title of this subsection 
about “mistakenly assuming linearity” in terms of properties of functions? 
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3.1.4. A happy tale: The most famous theorem of plane geometry. Since 
we have seen how visual proofs can go wrong, to restore our faith in them, we 
consider one of the many geometric proofs of the Pythagorean theorem. Visual 
proofs of the Pythagorean theorem can be made formally rigorous. However, since 
they are “clearly” correct, we will not present the details of how to make them 
formally rigorous. 

Consider a right triangle with side lengths a,b,c, where c is the length of the 
hypotenuse. 


Figure 3.1.3. A right triangle. 


Suppose we are asked to prove the [Pythagorean theorem} 

(3.13) a? +b? =". 

A picture that may come to mind is the one in Figure B.1.4] Here, we may think 
of our original triangle as a room. One of the proof ideas in geometry is to build 
a house around the room. Unfortunately, the house built in Figure [3.1.4] does not 
immediately suggest a proof, for why should the areas of the two smaller squares 
sum to the area of the bigger square? 

Here is where we need to use our creativity. As an given our tri- 
angular room, we can create any auxiliary structures attached to it that we like. 
In particular, if we mash up two of our right triangles together to form an a x b 
rectangle, it is not hard to imagine finally arriving at the diagram on the left of 
Figure B.L.5L)] The diagram on the right of Figure B.L5lis a square house of the 
same dimensions (a+ b) x (a+b). The Pythagorean theorem follows from the fact 
that the two houses have the same area. Here is the algebra: 


(3.14) Area of House One = a? + b? + 2ab, 


(3.15) Area of House Two = c? + 2ab. 


The Pythagorean theorem follows (just subtract 2ab “square feet” from both houses)! 


In “The Matrix” movie series, the Architect is the “father” of The Matrix. 
Likewise, we are the “parents” of the proofs we create. 


Theorem 3.4 (Pythagorean theorem). If a right triangle has hypotenuse of length 
c and the other side lengths are a and b, then 


(3.16) a? +b? = c*. 


1Mashup of triangles is not to be confused with |mashup| of songs! 
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Figure 3.1.4. The right triangle and the squares of its sides. 


Proof. Figure[B.1.5]is an illustration of a proof of the Pythagorean theorem. There 
are in fact many different proofs of the Pythagorean theorem; a google search will 
yield many examples. 


Figure 3.1.5. A visual proof of the Pythagorean theorem. |The house that 
Jack built is on the left. The house that Jill built is on the right. 


Exercise 3.5. In your own words, write out an explanation of a proof of the 
Pythagorean theorem based on Figure [3.1.5} 


3.2. Statements 


Although visual proofs are appealing and we often use geometric constructs to 
understand logical reasoning, it is important to learn how to write mathematical 
proofs. Before we learn how to write proofs, we need to learn how to correctly 
write the statements we are proving and/or learn how to understand and interpret 
statements. For example, we wish to shy away from ambiguous statements and 
questions. 

Here are some examples of ambiguous sentences to put in a recommendation 
letter: 


e A person like them is hard to find. 
e They are an unbelievable worker. 
e You will be lucky to get them to work for you. 


e Their true ability is deceiving. 
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e We cannot recommend them too highly. 


e You should waste no time in making them an offer. 


Just as in any writing, bad mathematical writing can be ambiguous: 


Question 3.5. If The Hulk runs around the lake twice and Captain America| runs 


around the lake 5 times more, how many times does Captain America run around 
the lake? 


Answer 1. 7, because 2+ 5 = 7. 
Answer 2. 12, because 2+5-2= 12. 


Which answer is correct? It depends on how one interprets the word “times” 
in the question. If each time is one time around the lake, Answer 1 is correct. If 
we mean 5 times the number of laps The Hulk has run, then Answer 2 is correct. 


Admittedly, this example of a communication breakdown was a|\cheap trick) taking 
advantage of |linguistic ambiguity! But, in general, we should be careful to make 


precise mathematical statements. 


Now, what is a statement? In short, it is a sentence that is either true or false 
(but of course not both). For example, 


“One and one and one is three’ 
is a true statement, provided we interpret it as saying that 1+1+1=<83. Similarly, 
“Two plus two is four. Minus one, that’s three, quick maths’|}| 
is also a true statement, at least in that 2+ 2 = 4 and 4—1= 3. Whether this is 
quick maths may be a matter of opinion! 
Some sentences cannot be either true or false, and hence these sentences are 


not statements. Consider a version of the 


“This sentence is false.” 


Is this sentence a statement? That is, is it either true or false? Let’s see: Suppose 
that the sentence is true. Then, by its very content, the sentence is false! On the 
other hand, suppose that the sentence is false. Then, it says that the sentence is 
not false; that is, the sentence is true. So, if the sentence is a statement, then we 
obtain a contradiction. We conclude that the sentence is not a statement! 


Other versions of the liar paradox are: “I am lying” and “This sentence is a 
lie.” Observe that these sentences are self-referential, a concept which occurs in 
language, logic, mathematics, and computing. 


Remark 3.6. A formal way to present the liar paradox is as follows. Given a 
statement P, let TV(P) denote its truth value. That is, TV(P) = T if P is true, 
and TV(P) = F if P is false. Then the form of the liar paradox sentence P is 


(3.17) P =(TV(P) =F). 


So, if TV(P) = T, then TV(P) = F. And, if TV(P) = F, then TV(P) = T. Each 
case leads to a contradiction. 


?From the song “Come Together’ by The Beatles. 
3From the song ‘Man’s Not Hot’ by Big Shaq. 
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Two professors of logic are in a classroom and they notice the 
statement “Only a doofus would believe a statement like this!” 
written on the blackboard. Puzzled, one of the professors asks 
the other, “Do you believe this statement?” Dismayed, the other 
replies, “Of course not! Only a doofus would believe a statement 
like this!” 


3.3. Truth teller and liar riddle: Asking the right question 


Mathematics is about asking and answering questions. We have seen the dangers 
of making statements or asking questions that are ambiguous, oxymoronic, or self- 
referential. Given a well-posed question, how do we answer it? Sometimes, it is by 
asking the right question in return! 

Abbott and Costello have asked the question: 


We instead ask the question: 
Who is telling the truth? 
is at the officer’s club in Cairo after a long walk in the 


desert. Two officers, named Beyoncé and Jay-Z, play a trick on him by hiding two 
glasses side by side behind the counter, one filled with hot coffee and the other 
filled with ice-cold One of the officers always tells the truth and the 
other officer always lies, but Lawrence does not know who is who. Lawrence, who 
is in desperate need of a lemonade, is allowed only one question. What question 
should poor Lawrence ask, and how shall he subsequently choose? 


The answer is that Lawrence should ask the question: 


Beyoncé or Jay-Z, which glass (left or right) will the other of you two tell me 
is the glass with the lemonade? 


Let us analyze this logically to determine how, given an answer, Lawrence 
should choose. Without loss of generality, we may assume that Beyoncé answers 
the question and that she says that Jay-Z will say that the lemonade glass is on 
the right. 


Case 1: Beyoncé is the truth teller. Then Jay-Z is the liar, and so since 
Beyoncé is telling the truth, Jay-Z will say that the lemonade glass is on the right 
and he will be lying. So we conclude that the lemonade glass is on the left! 


Case 2: Beyoncé is the liar. Then Jay-Z is the truth teller, and so since 
Beyoncé is lying, Jay-Z will say that the lemonade glass is on the left and he will 
be telling the truth. So, we again conclude that the lemonade glass is on the left! 


Since either Case 1 or Case 2 must be true and they lead to the same conclusion, 
Lawrence should confidently and happily choose the glass on the left! 


Exercise 3.6. Justify why, without loss of generality, we may assume that Beyoncé 
answers the question and that she says that Jay-Z will say that the lemonade glass 
is on the right. Namely, why are all of the other cases essentially the same as this 
case by reason of symmetry? 
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3.4*. Logic puzzles 


Proving statements in mathematics, just like solving riddles, is about finding a 
logical argument to ascertain the truth or falsehood of a statement. Logic puzzles 
are toy examples of such statements. Solving logical puzzles gives us a taste of the 
logical reasoning we do in mathematics. 


3.4.1. How many Terminators? Assume that at most one of the following 
three statements about is true. What can you deduce from this 
assumption? 

(1) Sarah owns at least two Terminators|4 
(2) Sarah owns less than nine Terminators. 
(3) Sarah owns at least nine Terminators. 

Let us figure out the solution to this puzzle. Suppose that (3) is true. Then 

this implies that (1) is true, contradicting that at most one of the statements is 


true. Therefore (3) is false. This tells us that (2) is true. This in turn implies that 
(1) is false. So what we can conclude is that Sarah owns at most one Terminator. 


Alternatively, since (3) is the negation of (2), we know that exactly one of (2) 
and (3) is true. Thus, if at most one of the three statements is true, then (1) is 
false. 


More abstractly, in the situation above we have three statements of the form: 
(1) P, 
(2) not Q (this is the negation of Q, which we formally define in 43.5.3] below, but 
it is what you think it is), 
(3) Q, 
where Q => P; that is, (3) > (1). 
Thus, if (3) is true, then (1) is true, a contradiction since we cannot have more 
than one statement being true. So (3) is false; i.e., Q is false; that is, (2) is true. 


Since at most one of the statements is true, this implies that (1) is false; i.e., we 
conclude that P is false. 


Remark 3.7. Observe that we actually proved that (1) and (3) are false, whereas 
(2) is true. In other words, we proved that “P is false” and “Q is false”. Now, our 
assumption that Q => P is equivalent to its contrapositive (not P) > (not Q); see 
{3.7 below. So we see that “P is false” is the stronger of these two statements since 
it implies “@ is false”. 


3.4.2. Ride My Seesaw 
“You can’t always get it when you really want it.” 


— From ‘Another Tricky Day’ by The Who 


Here is a rather tricky riddle (if you wish to avoid a lengthy analysis, you may 
skip it as it is not needed for any later discussion): 


4A |Terminator|is a cybernetic organism: living tissue over a metal endoskeleton. 
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Solved Problem 3.8. There are twelve bunnies on a beach in The Bahamas|' 
Eleven of them weigh exactly the same, and the twelfth bunny either weighs more 
or weighs less, but not the same as the other eleven. You have a(seesau) that you 
can put any number of bunnies on. Using the seesaw only three times, find out who 
is the odd-bunny out weight-wise and whether that bunny weighs more or less than 
the others. 


This riddle looks like it will succumb to some sort 0 method. 


But, because we are allowed only 3 weighings, it is tricky to figure out how exactly 
we should proceed with this idea! 


Figure 3.4.1. Six regular bunnies on a balanced seesaw. Tikz code adapted 
from the “improve the code for a rabbit” answer by samcarter_is_at_topanswers. 


xyz at Stack Exchange. Licensed under|Creative Commons Attribution-Share 
Alike 4.0 International License https://creativecommons.org/licenses/by-sa 


/4.0/deed.en) 


Solution. Label the bunnies 


(3.18) By, Bo, Bs, Cae) By. 
Let their weights be 
(3.19) Wi, Wa, W3, sey Wi, 


respectively. Call any one of the eleven bunnies weighing the same a regular 
bunny, and call the bunny weighing differently the funny bunny. 


In the following, the color coding is as follows: a green bunny is a bunny we 
know is regular, an orange bunny is a bunny we don’t know the type of, and a red 
bunny is a bunny we know is funny. So, hypothetically, if By is the funny bunny, 
then we have 

W, =We=::--=Wiu F Wie. 


A way to implement the aforementioned divide and conquer method is to divide 
a given group of bunnies of unknown type into three groups: 


(1) ones on one side of the seesaw, 
(2) ones on the other side of the seesaw, 
(3) ones not on the seesaw. 


A key observation is that since there is only one funny bunny, if we put equal 
numbers of bunnies on the two sides of the seesaw, then the seesaw is balanced 
if and only if all of the bunnies on the seesaw are regular. Here is how we can 
specifically proceed (this is not the only way): 


First weighing. Place bunnies 5), 82, 53, 8, on one side of the seesaw and 
bunnies B;, 8s, 87, 8s on the other side of the seesaw. The remaining bunnies 
Bg, Byo, By, By are not weighed. 


5One gets a rather different result if one uses)12 Monkeys| © 
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CASE 1: The first weighing is balanced: 
(3.20) W,4+W2+W34+W4=Ws+Woe+W7+ Ws. 


In this case, the bunnies B,, B2,..., Bg are all regular and hence colored green. 
Thus, the funny bunny must be one of the bunnies By, Bip, Bi, Biz. (Observe 
that since the bunnies B,, B2,..., Bg all weigh the same, from the point of view of 
weight they are interchangeable.) 


Case 1 second weighing. Place 65 and 29 on one side of the seesaw, and 
place 8); and B, on the other side of the seesaw (so that 5). is not on the seesaw). 


SUBCASE la: The second weighing is balanced: 
(3.21) Wo + Wi9 = Wi, + WY. 


So the bunnies Bo, Bio, Bi; are regular. This, together with B,, Bo,..., Bs all 
being regular, implies that Bj. is the funny bunny. Next, we need to determine if 
the funny bunny weighs more or less than the regular bunnies. 


Subcase la third weighing. So, for the third weighing, we put the funny 
bunny 82 on one side of the seesaw and your favorite regular bunny (of the other 
eleven) on the other side of the seesaw and see which of the two bunnies weighs 
more. This completes Subcase la. 


SUBCASE 1b: The second weighing is unbalanced with 
(3.22) Wo + Wi0 > Wii + Wi. 


Subcase 1b third weighing. Now place By on one side of the seesaw and 
Bio on the other side of the seesaw. 


SUBSUBCASE 1b(i): The third weighing is balanced: 
(3.23) Wo = Wo. 


In this case we conclude that By; is the funny bunny. By (8.22), we conclude 
that the funny bunny 5); weighs less than the regular bunnies. This completes 
Subsubcase 1b(i). 


SUBSUBCASE 1D(ii): 
(3.24) Wo > Wo. 
We leave Subsubcase 1b(ii) as an exercise. Hint: Show that Bo is the funny bunny. 
SUBSUBCASE 1D(iii): 
(3.25) Wo < Wo. 
We leave Subsubcase 1b(iii) as an exercise. Hint: Bo is a regular bunny. 
This completes Subcase 1b. 
SUBCASE tec: The second weighing is unbalanced with 
(3.26) Wo + Win < Wi1 + Wi. 
Subcase 1c third weighing. Again place [yo on one side of the seesaw and 
Bio on the other side of the seesaw. Similarly to Subcase 1b, we consider the three 


subsubcases (i), (ii), and (iii) depending on the result of the third weighing. We 
leave this as an exercise. 


3.4*, Logic puzzles 119 


This completes Subcase 1c and hence completes Case 1. 
CASE 2: The first weighing is unbalanced with 


(3.27) W,+W2+W3+W4>Ws+We+W7+ Ws. 


Then Bo, Bio, Bi, Big are all regular bunnies. 


Case 2 second weighing. Place 5), 52,55 on one side of the seesaw and 
place B;, B84, By on the other side of the seesaw (so that Bs, Bz, Bs are not on the 
seesaw). 


SUBCASE 2a: The second weighing yields 
(3.28) W,+We2+ Ws =W3+W4+ Wo. 


In this case, the bunnies B,, B2, Bs, B4, Bs are all regular. Hence the funny bunny 
is one of 


(3.29) Be Be Be. 


And by (8.27), the funny bunny weighs less than the regular bunnies. 


Subcase 2a third weighing. Place 6, on one side of the seesaw and place 
2B, on the other side of the seesaw. 


If W5 = W7, then Bg is the lighter funny bunny. 
If Wo 4 W7, then the lighter of the two Ds and 27 is the funny bunny. 
SUBCASE 2b: The second weighing yields 


(3.30) W,+We2+Ws >W3+W4+ Wo. 


In this case, either 6, or B» is the funny bunny—as, for every other possibility, 
(8.30) would contradict (8.27). So the funny bunny is heavier than the regular 
bunnies. The third step is to put 6; and 2 on the seesaw and see which bunny 
weighs more. 


SUBCASE 2c: The second weighing yields 
(3.31) W,+W2+Ws <W3+W4+ Wo. 


In this case, by comparing with (3.27), we see that either 2; or 5, is the funny 
bunny. And the funny bunny is heavier than the regular bunnies. Again, the third 
step is to put Bs and 4, on the seesaw and see which bunny weighs more. 


CASE 3: The first weighing is unbalanced with 


(3.32) W,+W2+W3+W4< Ws +We+W7+Ws. 


Exercise! This is very similar to Case 2. We can actually appeal to symmetry. 


The riddle above is a nice, albeit complicated, example of analyzing a problem 
by cases and subcases. 


Exercise 3.7. Fill in all the details that were omitted in the solution to Solved 
Problem B.8} 
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3.5. Logical connectives 


As we have seen, riddles give a pleasant entryway to logical thinking in mathemat- 
ics. In this section we begin our more formal discussion of the building blocks of 


logical thinking: |logical connectives) (or connectives for short). Since the reader 


no doubt already understands the words “or”, “and”, “not”, and “implies” at least 
in the working way we have encountered them in the first two chapters, our present 
discussion is a bit non-linear in order to introduce interesting examples to start. 


3.5.1. The logical connective “or”. We begin with: 


Definition 3.9 (The connective “or”). Let P and Q be statements. We say that 
the (compound) statement 
P or 


is true if at least one of the two statements P and Q is true. The connective “or” 


is also referred to as a 


Example 3.10. The statement 
(3.33) (e > 3) or (x > 3) 


is true since, although e > 3 is false, we have that 7 > 3 is true. 


In a formal mathematical sense, we can make the meaning of the logical con- 
nective “or” in Definition [3.9] more precise by defining it by a truth table. The 
truth table for the connective “or” is 


Table 3.5.1. The truth table for the logical connective “or”. 


P|Q|PorQ 
rie 
Pie | @ 
elt) @ 
F|F/] F 


For example, the third row says that if P is false and Q is true, then “P or Q” 
is true. This is actually the case for Example B.10] with P being the statement 
“e > 3” and Q being the statement “a > 3”. 


We also denote “or” by the symbol V, so that “P or Q” is written as 
(3.34) PVQ. 
In fact, we see from the truth table that the statement “P or Q” is false exactly 
when both the statement “P” and the statement “Q” are false. 

Observe that, given the statements P and Q, there are 4 = 2? possible truth 
values in total since for each of the 2 statements P and Q there are 2 possible 


truth values. Each of the rows in the truth table above represents 1 of these 4 
possibilities. 


3.5. Logical connectives 121 


Note also that “Q or P” has the same truth values as “P or Q”. The essential 
reason for this is that the truth values of “P or Q” in the second and third rows of 
Table [3.5.1] are the same. 


Here is an example of a true statement involving the connective “or”: 
Lemma 3.11. Leta and b be real numbers satisfying ab = 0. Thena=0 orb=0. 
Solved Problem 3.12. Prove Lemma [3-11] 


Solution. Let a and b be real numbers satisfying ab = 0. 


Case 1: a = 0. In this case we are done, since a = 0 implies a = 0 or b = of 


Case 2: a # 0. In this case, + is a well-defined real number and we may 


multiply the equation 0 = ab by it to obtain 
1 1 
(3.35) 0=--0=-—-ab=b. 


a a 


Now, since b = 0, we also conclude that a = 0 or b= 0. 


Since, for either Case 1 or Case 2 we have a = 0 or b = 0, we have proved the 
result. 


Exercise 3.8. Observe that the proof above is according to the following template. 
Statement: Suppose P. Then Q or R. 
Proof. Suppose P. 
Case 1: Assume Q. Then we are done. 
Case 2: Assume not Q. Prove R. 


Explain why this template (provided you can accomplish the goal of Case 2) proves 
the statement. 


The connective “or” is sometimes referred to as the “inclusive or”. On the 
other hand, an example from everyday life of the ‘exclusive or’, in the form of a 
question, is: 

“Would you like tea or coffee?” 


Most people would answer either “Tea, please” or “Coffee, please”, using the “ex- 
clusive or”, but a mathematician might answer “Yes, please!” using the “inclusive 
or”. 

So, to avoid funny answers, one should instead ask a mathematician the fol- 
lowing question: 


“Would you like only coffee, only tea, neither, or both? Please 
choose exactly one of these mutually exclusive options. Thank 
you!” © 

In this book, we will use: Either P or Q to denote the exclusive or. 


The properties of inequalities, addition, and multiplication provide nice exam- 
ples to understand logical connectives. Recall the following. 


°We take for granted this last implication, which is formally stated in Example 1) below. 
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Axiom 3.13 (Trichotomy law for inequalities). Let a and b be real numbers. Then 


exactly one of the following is true: 

(3.36) a<b, b<a, a=b. 

By definition, a > b if and only if b <a. Also by definition, a < b if and only if 
a<bora=b. Similarly, by definition, a > b if and only ifa>b ora=b. 


We have the following properties. 
Axiom 3.14 (Inequalities, addition, and multiplication). Let a, b,c be real numbers. 


(1) (Addition) Ifa <b, thena+c<b+e. 
(2) (Multiplication by a positive number) If a <b and c > 0, then ac < be. 
(3) (Multiplication by a negative number) If a <b and c <0, then ac > be. 


For example, if a < b, then 2a < 2b, but —2a > —2b. 
The following shows us how the multiplication axiom changes when we replace 
< and > by < and >, respectively. 


Solved Problem 3.15. Prove that ifa <b andc> 0, then ac < be. 


Solution. Suppose that a < b and c> 0. 

Case 1: a= b. In this case we have ac = bc, which implies that ac < bc, so we 
are done. 

Case 2: a # db. Since a < b, by definition we have a = b or a < b. Since a £ , 
we have that a = 6 is false, so that a < b is true. 

Subcase 2a: c = 0. In this case we have ac = 0 = bc, and we are done. 

Subcase 2b: c #0. Since c > 0, we have c > 0. Since we also have a < b, we 
can apply Axiom[8.14[{2) to conclude that ac < bc. Since this implies that ac < be, 
we are done. 


In conclusion, we have proved the result by cases. 
Exercise 3.9. Let a,b,c be real numbers. Prove: Ifa <b, thena+c<b+e. 


Exercise 3.10. Let a,b,c be real numbers. Prove: If c > 0 and ac < bc, then 
a<b. Hint: If c is a positive real number, then + is a well-defined positive real 
number. 


3.5.2. The connective “and”. We begin with: 


Definition 3.16 (The connective “and”). Given two statements P and Q, we say 
that the statement 
P and Q 


is true if both of the two statements P and Q are true. The connective “and” is 


also referred to as a\conjunction| 


If a > 0, then we say that a is non-negative. If a < 0, then we say that 
a is non-positive. Let us prove the following result with a conjunction in the 
hypothesis. 
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Lemma 3.17. Leta be a real number. If a is non-negative and a is non-positive, 
then a =0. 

Proof. By the hypothesis, we have 

(3.37) (a >0ora=0) and (a<0ora=0). 


Suppose for a contradiction that a > 0. Then, by (a < 0 or a = 0) and the 
trichotomy law (Axiom [3.13), we have a contradiction. Therefore, the statement 
a > 0 is false. This and (a > 0 or a = 0) being true imply that a = 0. 


So, being simultaneously non-negative and non-positive is not contradictory. 


This reminds us of Schrodinger’s cat 


Wanted, dead and alive: Schrédinger’s cat. 


All the difference a logical connective makes. Switch it and you get a/Bon Jovi 
song: |Wanted Dead or Alive 
We can summarize the definition of “and” by the truth table 


P|Q|PandQ 


ap | ae aT 
(3.38) T|F F 
ai F 
F|F F 


This truth table gives us a formal definition of the connective “and”. 


For example, the second row of the truth table says that if P is true and if Q 
is false, then “P and (” is false. We also denote “and” by the symbol A, so that 
“P and Q” is written as 


(3.39) PAQ. 


We see from the truth table that the statement “P and (Q” is true exactly when 
both the statement “P” and the statement “Q” are true. 


Here is another simple example of a true statement involving the connective 
66 ” 
and”: 


Lemma 3.18. Leta and b be non-negative real numbers such thata+b=0. Then 
a=0andb=0. 


Exercise 3.11. Prove Lemma [3.18] Hint: After stating the hypotheses, one can, 
for example, start the proof by “Suppose for a contradiction that a 4 0.” 


As a visual device to think about some logical statements, we represent these 
statements by Venn diagrams. In Figure|3.5.1} each statement is represented by a 
set. The logical connective “and” is represented by “intersection”, and the logical 
connective “or” is represented by “union”. The motivations for these representa- 
tions are given in Chapter Jon set theory. 
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e Q vad Q 


Figure 3.5.1. Left: Visualizing P and Q. Right: Visualizing P or Q. 


3.5.3. The logical connective “not”. Given a statement P, we say that the 
statement 
not P 
is true if P is false. 


For example, since the statement “a is an integer” is false, we have that the 
statement “not (7 is an integer)” is true. 


The connective “not” is also referred to as a|negation| 


The truth table for “not” is 


P| not P 
(3.40) T F 
F T 


We also denote “not” by the symbol —, so that “not P” is written as 
(3.41) AP. 
From the truth table we see that “P” is true exactly when “not P” is false. We 
also see that “P” is false exactly when “not P” is true. 

The negation of a negation is equivalent to the original statement. We can see 
this easily by the following truth table: 

P | not P | not (not P) 

(3.42) T| F | T 


F T F 


To see why the truth values in the third column are so, we may rename the state- 
ment “not P” as “Q”. Then we see that the second and third columns of (8.42) 
are the same as the truth table with P replaced by Q. That is, in the 
third column follows from the second column. 


Since the truth values of the first and third columns are the same, we have that 
the statement P is logically equivalent to the statement not (not P). That is, 
(3.43) P istrue exactly when not (not P) is true. 


For example, if we say, ‘'That’s not not true’, we mean “That’s true.” Perhaps, a 


more common version of this is: “That’s not untrue!” 


Example 3.19. Consider the statement: not (a < y). By the trichtomy law, this 
is equivalent to y < x or x = y; that is, y < x (equivalently, x > y). 


3.5. Logical connectives 125 


Exercise 3.12. Let x be a real number. Prove: If x #0, then x? > 0. Hint: Since 
x is non-zero by assumption, we have x > 0 ora <0. Now use Axiom [3.14 


not P 


Figure 3.5.2. Visualizing the negation. 


In Figure[3.5.2| a statement P is represented by a set, and its negation (not P) 
is represented by the “complement” of the set. See Chapter [5] for the motivation 
for this representation. 


3.5.4. Tautologies. Now that we have discussed the logical connectives “or”, 
“and”, and “not”, we can consider statements consisting of two or more statements 
joined by these logical connectives. We start with, no pun intended, trivialities. 


Consider the statement 
‘2B or (not 2B)”. 


To most people, this is a famous poetic sentence of Shakespeare, but we will rather 
think of it as a mathematical statement! In fact, this sentence is Much Ado About 
Nothing: it is a tautology, which we will now explain. 


A tautology) is a compound statement in the form of a “formula” of state- 
ments which is always true independent of the truth values of the statements it is 
comprised of. 


We can see that the compound statement “2B or (not 2B)” is a tautology by 
considering its truth table: 


2B | not 2B | 2B or (not 2B) 
T F a 
at 


So, no matter what the truth value of 2B is, the statement “2B or (not 2B)” is 
true, and hence it is a tautology! So Shakespeare was correct after all! 
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Example 3.20. The compound statement 
(P or Q) or ((not P) or (not Q)) 
is a tautology by the following truth table: 


|P|Q|PorQ | not P | not Q | (not P) or (not Q) | (P or Q) or ((not P) or (not Q)) | 


ae |) a0 ae F F F Oe 
T|/F ay F T at als 
1 | we ale T F a le 
F|F F T ly T ae 


Some |Yogi-isms|, i.e., statements by are tautologies when suitably 


interpreted. For example, 


It ain’t over ’til it’s over. 
It’s déja vu all over again. 
You can observe a lot by watching. 
Of course, this is all tongue in cheek. We’ve often watched while not observing. 
Many Yogi-isms are contradictions (see the next subsection): 
Nobody goes there anymore. It’s too crowded. 
I really didn’t say everything I said. 
What do you make of the following Yogi-isms? Are they tautologies, contra- 
dictions, or neither? 


A nickel ain’t worth a dime anymore. 
Always go to other people’s funerals; otherwise they won’t go to yours. 
If you can’t imitate him, don’t copy him. 


So, what is the goal of mathematics? Starting with your premises, use a bunch 
of tautologies to logically deduce your desired conclusion! 


Example 3.21. Let P and Q be statements. We have the following tautologies: 
(1) P or (not P). 
(2) (P or Q) or ((not P) or Q). 
(3) (P and Q) or ((not P) or (not Q)). 

Exercise 3.13. Prove that the statements in parts (2) and (3) of Example are 


tautologies. 


3.5.5. Contradictions. On the other hand, some trivialities can be oxymoronic. 
A contradiction is a compound statement in the form of a “formula” of statements 
which is always false independent of the truth values of the statements it is com- 
prised of. 


For example: 
Proposition 3.22. The non-Shakespearean statement 
“2C and (not 2C)” 


is a contradiction; i.e., it is false (independent of what the statement “2C” is). 


3.5. Logical connectives 127 


Proof. We prove this by constructing a truth table. Let P = 2C’. We have 


P | not P | P and (not P) 
(3.44) T | F | F 


F T F 


Here, we used the truth tables for “and” in (8.38) and for “not” in (8.40). In 
particular, we used the truth values in the second and third rows of (8.38). 


Most of the time, when we derive a contradiction in a proof, it is a statement 
of the form “P and (not P)”. 


Historical note: The 1992 movie ‘Wayne’s World” made popular the use of the 


word “not” at the end of sentence. Here is an example: 
“I find this lecture interesting, not!” 
Here is an interesting self-contradictory quote: 
“We learn from history that we do not learn from history.” ~|Georg W. F. Hegel 
Example 3.23. The compound statement 
(P or Q) and ((not P) and (not Q)) 


is a contradiction by the following truth table: 


| | Q | PorQ | not P | not Q | (not P) and (not Q) | (P or Q) and ((not P) and (not Q)) | 
‘A || ae a F F F F 
AW || 1 al F T F F 
F/T Ay T F F F 
F|F F T T T F 


Example 3.24. Let P, Q, and R be statements. We have the following contradic- 
tions: 


(1) P and (not P). 
(2) (P and Q) and ((not P) and R). 
(3) (P and Q) and ((not P) or (not Q)). 


Exercise 3.14. Prove that the statements in parts (2) and (3) of Example [3.24] are 
contradictions. 


Exercise 3.15. Is 
(3.45) (P or Q) and ((not P) or (not Q)) 


a contradiction? Hint: What happens for example if P is true and Q is false? 


In §3-8] below we will discuss proof by contradiction! Namely, suppose we are 
trying to prove the following statement: ‘(quod erat demonstrandum) . We begin by 
assuming that quod erat demonstrandum is false. If, from this assumption, we can 
prove a contradiction, then we are done. That is, we will have proved that quod 
erat demonstrandum. is true! 
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3.6. Implications 


“In mathematics you don’t understand things. You just get used to 


them.” — John von Neumann 


In this section we discuss the familiar notion of implication, which is a logical 
connective and which we have been using throughout the book up to now, in a 
more formal way. 


3.6.1. A literary introduction to implications. In the folk tale Cinderella, Cin- 
derella’s stepmother tells her: 


If you complete your chores, then you can go to the ball. 


This is alconditional statement) or implication, that is, an if-then statement! 


Such a statement has the general form: 
(3.46) P=>Q, 
where P and @ themselves are statements. Recall that a statement is either true 
or false. 

What is the mathematical meaning of an implication? Firstly, here is what it 
is not: causation. For example, consider the sentence: 


If it rains, then my porch gets wet. 


In this sentence we see causation: the rain is what causes the porch to get wet. On 
the other hand, in mathematics, we view implications differently: 


An implication does not imply (pun intended) cause and effect. 
Given two statements P and Q, we say that: 


(1) The implication P = Q is false if P is true and Q is false. 


(2) The implication P > Q is true if any one of the following cases is true: 
(a) P and Q are true. 
(b) P is false and Q is true. 
(c) P and Q are false. 


Let us first see if such a definition of implication is reasonable. For fun, consider 
first the implication above from the story of Cinderella. Let us suppose, according 
the stepmother’s wishes, that Cinderella will never “P” = complete her chores and 
will never “Q” = go to the ball. In this case, both P and Q are false, but by rule 
2(c) above, the implication P = Q is true! 


Example 3.25. Consider the following implication: 


If the moon is made of green cheese, then the earth is flat. 


Here, again, both the hypothesis and conclusion are false (let us make the assump- 
tion that we are stating facts and not opinions!), and hence the implication is true! 
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It is useful to illustrate the meaning of implication by the truth table defining 
it as a logical connective: 


HHpaH 


P| Q| 
1) Te 
(3.47) T|F 
a 
F|F | 


This truth table defines in a formal sense the implication =. From this truth table 
we see that the truth value of P = Q depends only on the truth values of P and 
Q, as is the case with the other logical connectives that we have discussed. 


Observe that if Q is true, then the implication P => Q is true independent of 
whether P is true or false. For example, the statement 


(3.48) (P=) =1) 


is true simply because 1? = 1 is true. In a sense, the interesting aspect of an 
implication P = Q is when P is true, and in this case when the implication is true, 
we then know that Q is true. 


In a logical argument, when we write 
“If P, then Q”, 


we mean that if P is true, then Q is true. In particular, if P is false, then nothing 
is said about whether Q is true. Other ways of writing this implication are: 


P implies Q. 
Q if P. 
P only ifQ (we will not use this much). 


Example 3.26. We have the following examples of tautologies involving implica- 
tions: 


(1) P+ (Por Q). 
(2) (P and Q) > P. 

(3) (P and Q) => (P or Q). 
(4) (P and Q) > (Q and P). 
(5) (P or Q) © (Q or P). 


Solved Problem 3.27. Prove that the statement in part (1) of Example 3.26] is a 
tautology. 


Solution. We give two proofs. 


(1) Proof by logical inference. Suppose P is true. Then P or Q is true, since 
we know that at least P is true. Therefore the implication P > (P or Q) is true. 
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(2) Proof by truth table. We have the following truth table: 
P| Q|PorQ|P=(PorQ) 


a if T 
(3.49) T|F a a. 
F/T T T 
F|F F af 


Since the truth values in the last column are all T’s, we have proved the result. 


Exercise 3.16. Prove that each of the statements in parts (2)-(5) of Example [3.26 


is a tautology. 
P P Q 


Figure 3.6.1. A Venn diagram visualization of Example 1). 


The truth of an implication P => R is represented by a Venn diagram as the 
set P being a subset of the set Ri In Figure 3.6.1} since the set P is a subset of 
the union of P and Q, this represents the implication P = (P or Q) being true. 
So this gives a visualization of part (1) of Example B.26] 


Exercise 3.17. Can you draw Venn diagrams to illustrate the tautologies in parts 


(2)-(5) of Example B.26)? 


3.6.2. The converse. The title of this subsection does not refer to the manu- 
facturer of a shoe! 


Given an implication 


(3.50) P=, 

its implication is 

(3.51) O=> P: 

In other words, the converse of the statement “If P, then Q” is the statement 
“If Q, then P.” 


It is a rather easy fact that if an implication is true, then its converse is not 
necessarily true. So, in general, as in this case, an implication does not have the 
same meaning as its converse. A simple example is the following implication: If n 
is an even integer, then n is an integer. Clearly this implication is true, whereas 
its converse is false. 


*See Chapter[5] for a formal definition of the notion of subset. 
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The converse of “If an integer a is even, then a? is even” is 
“Tf a? is even, then a is even” 


(being understood that a is an integer). For this implication, both it and its converse 
are true. 


Example 3.28. Consider the following true statement: 
If x > 0, then x? > 0. 

Its converse is 
If x? > 0, then x > 0. 


However, the converse is false since —1 < 0 and (—1)” > 0. 


The following is the converse of the implication in Exercise B.12] 


Exercise 3.18. Prove the following: Let x ER. If x? >0, thenx £0. 


We have the following implication: 


Exercise 3.19. Let a and b be real numbers. Prove that if (a > 0 and b > 0) or 
(a <0 and b < 0), then ab > 0. 


And its converse: 


Exercise 3.20. Leta and b be real numbers. Prove that if ab > 0, then (a > 0 and 
b> 0) or (a <0 andb< 0). 


3.6.3. Transitivity of implication. Having seen some examples of implications, 
we now consider one of its fundamental properties. 


Lemma 3.29. Suppose P=> Q and Q = R are both true. Then P => R is true. 


See Figure B.6.2] for a visualization of the transitivity property of implication. 


Exercise 3.21. Prove Lemma |3.29 


Figure 3.6.2. Visualizing the transitivity of implication: small = medium = large. 
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3.6.4. The biconditional. By combining an implication and its converse using 
the logical connective “and”, we obtain the following. 


Definition 3.30. Let P and Q be statements. The statement P = Q means that 
(P => Q) and (Q => P). That is, the implication (P = Q) and its converse are 


true. This is called a\biconditional| 


In other words, we have both: 
“Tf P is true, then Q is true” and “If Q is true, then P is true.” 
In this case, we also say that “P if and only if Q” or “P is equivalent to Q.” 
Example 3.31. The following true statements are each biconditional: 
(1) For any integer n, n is even if and only if n? is even. 
(2) For any real number z, x? > 0 if and only if « 4 0. 
(3) For any prime p, p is odd if and only if p > 2. 


Exercise 3.22. Prove, using either a truth table or a logical argument, that each 
of the following statements are tautologies: 


(1) P or (not P). 


(2) (P>Q) = (not Q => not P). 

(3) ((not P > Q) and (not P = not Q)) => P. 
(4) not (P and Q) = (not P or not Q) 

(5) ((P or Q) and (P > R) and (Q=> R)) SR. 


3.6.5. De Morgan’s laws. Particularly useful equivalences of compound state- 
ments are the following. In a sense, negating an “or” yields an “and”, and vice 
versa. 


Solved Problem 3.32 (De Morgan’s laws). Prove that 

not (P orQ) <= (not P) and (not Q) 
and 

not (P andQ) <= (not P) or (not Q) 


are tautologies. 


Solution. We can see that the first statement is a tautology by considering its 
truth table in Table [3.6.1] 


Table 3.6.1. Truth table for not (P or Q) <— (not P) and (not Q). 


| P and Q | not (P and Q) | not P | not Q | (not P) or (not Q) 


Q 
By 
F 
T 
F 


yy | ty 
gomyy 
HAH 
Hay 
BW yy 
WeaAsy 
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Evidently, we see that the truth values under the statements “not (P and Q)” 
and “(not P) or (not Q)” match (indeed, they are from top to bottom: F, T, T, 
T), so the statement that they are equivalent is a tautology, i.e., always true. 

Similarly, one can prove the second statement. We leave this to you, dear 
reader! 

We can alternatively prove that the first statement is a tautology by a logical 
argument as follows. 

Suppose: not (P and Q). This is equivalent to: P and Q is false. By the 
definition of the logical connective “and”, this means that at least one of P and Q 
is false. Equivalently, P is false or Q is false (we always use the inclusive “or’”). 
That is, (not P) or (not Q) is true. We have established the desired equivalence. 


See Figure for a visualization. 


69} _ 


PorQ not (P or Q) (not P) and (not Q) 


Figure 3.6.3. Visualizing one of De Morgan’s laws. On the far right: 
(not P) consists of the union of P — Q and (not (P or Q)), and (not Q) 
consists of the union of Q — P and (not (P or Q)). We see that their intersec- 
tion (corresponding to “and”) is (not (P or Q)). 


Exercise 3.23. Let a and b be real numbers with a < b. Let x be a real number 
such thata < « < b is false. Prove thatx <a ora > 0b. Hint: a< ax < b means 
z>aandxz <b. 


3.6.6. Rules of inference. In this subsection we discuss slightly more formally 
some of the logical reasoning concepts we have encountered thus far. However, the 
overall style of this book is to consider logic more informally. So we do not go into 
formal logic in any detail. 


Propositional logic) is the study of statements (a.k.a. propositions), relations 


between them, and logical arguments involving these statements. 


is the following 


Suppose that P is true and that P => Q is true. Then we can 
deduce that Q is true. 


Example 3.33. Here is an example of modus ponens: 


2 divides x”. If 2 divides x”, then 2 divides x. Therefore: 2 divides 
z. 


Here, we assume that x is an integer whose square is divisible by 2. Then we state 
the fact that if x? is divisible by 2, then so is x. By modus ponens, we can conclude 
that x is divisible by 2. 
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{Modus tollens| is the following rule of inference: 
If P, then Q. Not Q is true. Therefore: not P is true. 
Example 3.34. Here is an example of modus tollens: 
If 3 divides both a and 8, then 3 divides 5a+ 7b. 3 does not divide 
5a+ 7b. Therefore: 3 does not divide both a and b. 


Modus tollens is related to contraposition, which is the following rule of infer- 
ence (compare with the contrapositive defined in §3.7] below): 


P= Q is true. Therefore: not Q > not P is true. 
Example 3.35. Here is an example of contraposition: 


If x is even, then x? is even. Therefore: If x? is odd, then z is odd. 


3.7. Contrapositive 


I am not, therefore I do not think. 
Goodness! The statement above must be true! © You will know why from this 


section. 


3.7.1. The definition of contrapositive and examples. As we have seen, for an 
implication, its converse is not logically equivalent. However, we can add negation 
into the mix to rectify this. 


Definition 3.36. The |contrapositive’ of the statement P => Q is the statement 
(3.52) (not Q) => (not P). 
Example 3.37. Let the domain of discourse be R. Consider the implication: 
If « > 0, then x? > 0. 
The contrapositive of this statement is the implication: 
If 2? < 0, then x < 0. 


Observe that the following true implication is a stronger statement than the 
contrapositive above: 


If x? <0, then x = 0. 
Equivalent to this implication is its contrapositive: 
If « £0, then x? > 0. 
3.7.2. Equivalence of an implication with its contrapositive. This is a conse- 
quence of Truth Table B-7.1] Since the truth values of the third and sixth columns 


therein are the same, we have the logical equivalence of an implication P > Q with 
its contrapositive (not Q) > (not P). 


Exercise 3.24. Let x be a real number, and let k be a positive integer. Prove that 
if c* >0, then x £0. 
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Table 3.7.1. The truth table for an implication and its contrapositive. 


P| Q | P= Q | not P | not Q | (not Q ) > (not P) 
T/T T F F T 
TF F T F F 
F | T T F T T 
F|F T T T T 


3.7.3. Contrapositives of some results that we have proven. In Chapter [I] 
we actually considered a number of implications. We now consider some of their 
contrapositives. Recall from Corollary [.30]that the square of an odd integer is odd 
and the square of an even integer is even. 


By taking the contrapositives of these implications, we obtain: 
Corollary 3.38. We have the following implications regarding squares of integers: 
(1) Ifa is an integer such that a? is even, then a is even. 
(2) If b is an integer such that b? is odd, then b is odd. 
By combining this with Corollary [1.30] itself, we get the biconditional state- 
ments: 
Corollary 3.39. Let a be an integer. Then: 
(1) a is even if and only if a? is even. 
(2) a is odd if and only if a? is odd. 
Exercise 3.25. Using results from Chapter [I] give short proofs of Corollaries [3.38] 
and [3.39] 


Exercise 3.26. Prove that if the sum of two integers is even, then either both 
integers are even or both integers are odd. 


not Q 


no 


Figure 3.7.1. Visualizing the contrapositive (not Q) = (not P) of the impli- 
cation P => Q. The blue and green regions together represent @, and the red 
and yellow regions represent (not P). 
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Exercise 3.27. Prove that if the product of two integers is odd, then both integers 
are odd. 


Recall that. Theorem [1.27] says that the sum of two odd integers is even. So, 
the contrapositive of this statement is: 


If the sum of two integers is not even, then they are both not odd. 
That is: 
If the sum of two integers is odd, then they are both even. 


Oops (ahem), where did we go wrong?! When negating the implication, we incor- 
rectly negated the hypothesis. The contrapositive should read: 


If the sum of two integers is not even, then they are not both odd. 


What a difference in meaning interchanging the words “both” and “not” makes! 
In other words: 
If the sum of two integers is odd, then at least one of them is even. 
Although this is a nice and true statement, we can do better (we leave this as an 
exercise): 


If the sum of two integers is odd, then exactly one of them is even. 


3.7.4. The contrapositive of the contrapositive. Given a conditional state- 
ment P > Q, its contrapositive is (not Q) => (not P). In turn, the contrapositive 
of this second implication is 


(3.53) (not (not P)) = (not (not Q)). 


Since the negation of the negation of a statement is the statement itself, we have 

that this implication is equivalent to the original implication 

(3.54) P=Q. 

Of course, we already knew this from the transitivity of logical equivalence, but 

here we again see it from a slightly different angle (the negation of the negation). 
Now, let us return to Descartes’s statement “I think, therefore I am”. This 

says: I think > I am. So its contrapositive is: Iam not = I do not think. So the 

implication quoted at the beginning of this subsection is indeed true! Of course, 


you may say that the person making this statement is thinking and hence there is 
an apparent contradiction to them not existing! 


Let us now think about this more carefully. What Descartes is actually saying 
is the following: 


Premise: I think. 
Implication: If I think, then I am. 
Conclusion: I am. 


This is an example of 


However, for brevity Descartes’s formatted the statement as 


Premise, therefore Conclusion. 


3.7. Contrapositive 137 


Indeed, the Implication is implicit in his statement. 


The contrapositive of the implication in Descartes’s argument is: 


Contrapositive: If I am not, then I do not think. 


In order to obtain the conclusion “I do not think”, we need to assume the premise 
“Tam not.” However, by the very making of this premise, we are existing. © 


On the other hand, the converse of Descartes’s statement is: 
I am, therefore I think. 


We like to think of this as a false implication. But we can debate this. For example, 
this statement is self-referential. So, is the entity making this statement thinking? 
However, the entity might be a computer. Do computers think? Also, are self- 
referential sentences necessarily statements? And we can go on. In any case, our 
noodle is baked! Fortunately, we will not consider such sentences as mathematical 
statements. 


3.7.5. Powers in Mersenne primes are prime. The following result, whose 
proof is formulated in terms of contraposition, certainly helps in the search for 
Mersenne primes. If you did Exercise [1.10] then you have already seen the proof, 
which is accomplished by proving the contrapositive implication. 


Theorem 3.40. [fn is a positive integer such that 2” —1 is a prime, then n itself 


is prime. 


Proof. Firstly, if mn = 1, then 2”. — 1 = 1 is not prime. Therefore we may assume 
that n > 2. 


We prove the contrapositive. Suppose that n > 2 is a composite number; that 
is, n = ab, where a,b > 1 are integers. We have 


(3.55) Q” —1= (27)? —1. 

Now recall the general formula (see (2.21)) 

(3.56) a? —1=(e2—-1)(22 142° 7 4..-42+41). 

Taking x = 2°, which is an integer, we obtain 

(3.57) g” —1 = (27)?_-1 = (27 — jer 4 gub-2) 4p gay 1). 
Observe that the factor 2° — 1 > 1 since a > 1. Similarly, the factor 


DORN) ge Qh 2) eos BO 1 Sa 


since a,b > 1. We conclude that 2” — 1 is a composite number. 
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3.8. Proof by contradiction 


Ted: What if we were lying? 
Bill: Why would we lie to ourselves? 


Checkmate! Thus describes the essence of proof by contradiction, a.k.a. |{reductio| 
(Latin for “reduction to absurdity”). We will consider absurdity as a 
synonym for contradiction. So for reductio ad absurdum, we start with assuming 
what we want to prove is false and deduce a contradiction from this assumption. 
Our only conclusion can be that what we want to prove is true! 


In other words: 


Mathematician Ted: Suppose that we were lying. 
Mathematician Bill: This leads to a contradiction. 
Dear reader: Therefore Bill and Ted were not lying! 


We have the following lovely quote: 


Reductio ad absurdum, which Euclid loved so much, is one of a mathe- 
matician’s finest weapons. It is a far finer gambit) than any chess play: 
a chess player may offer the sacrifice of a pawn or even a piece, but a 
mathematician offers the game. 


— G. H. Hardy, A Mathematician’s Apology 


3.8.1. Proofs by contradiction that we have seen. For example, the following 
proofs which we have already encountered in Chapter [IJare proofs by contradiction. 
We encourage the reader to review that these proofs are indeed by contradiction. 


1) The proof of Lemma [L.5] characterizing when an integer at least 2 is not a 
prime. 


2) The proof of Lemma[L.8]that any divisor of a positive integer is at most that 
integer. 


3) The proof of Lemma [L.21] that 3 is an odd integer, and its generalization to 
Lemma [1.22 


4) Theorem that the sum of an even integer and an odd integer is odd. 


5) The proof of Theorem that any factorization of 1 by positive integers is 
trivial. 


6) In the proof of Corollary [1.38] we proved the statement that if n? = 1, then 
n#0. 
7) After Definition [51] we proved that two distinct primes are coprime. 


You’ve probably heard the expression (paraphrasing): 


There are many ways to be wrong, but there is only one way to be 
right. 
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Similarly, there are usually many ways to prove something wrong whereas there are 
usually few ways to prove something right. This explains why proof by contradiction 
is so effective: 


There are few ways to prove a statement, but there are many ways 
to disprove the negation of the statement. 
Maybe we are serious; maybe we are not! © 
In any case: 


Proof by contradiction is one of the most important methods of 
proof in mathematics! 


Solved Problem 3.41. Prove that if a divides b and if a does not divide c, then 
b does not divide c. 


Solution. Suppose for a contradiction that a divides b and that a does not 
divide c, but b divides c. Then, since a divides b and 6 divides c, by the transitivity 
of division, we obtain that a divides c. However, this contradicts a does not divide 
C. 


Mini-analysis of the solution. The template of the statement is: 
If P and (not Q), then (not R). 


For a contradiction, we assumed that P, (not Q), and R are true. We derived from 
this that Q is true. Since this contradicts (not Q), the statement is proved. 


Exercise 3.28. Prove Corollary from Theorem (We didn’t write out 
the detailed proof earlier.) 


Exercise 3.29. Prove that if a > b> 0 are integers, then a is not a divisor of b. 


Exercise 3.30. Let a,b,c,m,n be non-zero integers. Prove that if c does not divide 
ma-+nb, then c does not divide a, or c does not divide b. 


Exercise 3.31. Generalize Proposition to the following statement: Suppose 
that d divides both a and b. Prove that if d does not divide c, then there do not 
exist integers m and n that solve the equation 


(3.58) am + bn =. 
3.8.2. Confounded by conflating contradictions with contrapositives. Proof 


by contrapositive is a special case of proof by contradiction. The template for a 
proof by contradiction of an implication P > Q is: 


Suppose P and (not Q). Yada, yada, yada. Then R and (not R). 


Here, R is a suitable statement. On the other hand, the template for a proof by 
contrapositive is: 


Suppose (not Q). Yada, yada, yada. Then (not P). 


So, we can think of proof by contrapositive as the special case where R = P of 
proof by contradiction. So, when one proves something by contrapositive, it may 
seem natural to think of it in terms of proving by the more general method of 
contradiction, although characterizing the proof as by contrapositive is more apt. 
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3.8.3. Solving, or not solving(!), linear equations over the integers. The 
following is an example of a linear Diophantine equation, which we will study 
in earnest in §4.5] For this example, we prove the non-existence of solutions by 
contradiction. 


Proposition 3.42. There do not exist integers m and n that solve the equation 
(3.59) 15m + 18n = 1024. 


Proof. Here is the idea of the proof: Suppose, for a contradiction, that the propo- 
sition is false. Then there exist integers m and n that satisfy the equation 


(3.60) 15m + 18n = 1024. 
What is wrong with this picture? 
We observe (here, if you don’t know your powers of two, is where we may have 
made it difficult for you!) that 
(3.61) 1024 = 2)°. 


In particular, the only prime divisor of the right-hand side of equation (3.60) is 2. 
In particular, 1024 is not divisible by the prime 3. However, the left-hand side is 
divisible by 3 since 


(3.62) 15m + 18n = 3- (5m + 6n). 


So, zf a solution exists, we find that there exists a number, namely the number 
1024, which is at the same time divisible by 3 and not divisible by 3, which is a 
contradiction! Thus, no solution to (8.60) exists. 


The proposition leads to the following (easy) generalization, which we leave as 
an exercise to the reader: Suppose that integers m and n satisfy an equation of the 
form 


(3.63) Am+ Bn=C, 


where A, B,C are integers. If d is a common divisor of A and B, then d divides C. 
In particular, the greatest common divisor 


g := gcd(A, B) divides C. 
We conclude that if g does not divide C’, then (3.63) has no integer solutions. 


Example 3.43. The linear Diophantine equation 51m + 68n = 1699 has no solu- 
tions. Indeed, gcd(51, 68) = 17, whereas 17 does not divide 1699. 


3.8.4. Results about the greatest common divisor. 


Solved Problem 3.44. Prove that if an integer n > 2 is not a prime number, 
then gcd(n,(n—1)!) > 1. In other words, its contrapositive: If an integer n > 2 
satisfies gcd(n, (n — 1)!) = 1, then n is a prime number. 


Solution. Suppose that n > 2 is not a prime number. Then there exist integers 
1<a,b<nsuch that n = ab. It is easy to see that a divides both n and (n — 1)!. 
Thus, 


gcd(n,(n—1)!) >a>1. 
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The interested reader may wish to prove that if n > 2 is a composite number, 
then gcd(n, (n — 1)!) =n. 


Exercise 3.32. Let p be a prime and let a be an integer. Prove that if gcd(a, p) > 1, 
then p divides a. 


How does this exercise compare to Exercise [1.34]? 


Exercise 3.33. Let n > 2 be an integer. Prove the converse of Solved Problem 
3.44} If gcd(n,(n —1)!) > 1, then n is not a prime. Hint: You may quote a result 
about the case where n is a prime. 


3.8.5. The square root of 2 is irrational! 


Question: What is an irrational number? 
Answer: A number that thinks that two plus two is five! 


Sadly, we’ve tried over and over again to teach the square root of two to think 
logically, but alas: 


Theorem 3.45. V2 is an irrational number. 


Proof. For a contradiction(!), suppose that V2 is a rational number. Then there 
exist integers a and b such that 


a 
2=— >, 
V2 5 


By Theorem [L.50)] we may assume that gcd(a,b) = 1. In particular, a and 6 cannot 
both be even. 


Now 2 = . so that 
(3.64) 2b? =a’. 


By Corollary [B:38] since a? is even, a is even. Now, since a is even, there exists 
q € Z such that a = 2q. Hence 2b? = (2q)* = 4¢?, so that 


(3.65) b? = 2q?. 


Since b? is even, b is even. That both a and } are even contradicts that gcd(a, b) = 1. 
We conclude that 2 is irrational. 


This is an iconic proof. 


Remark 3.46. If we do not assume that a and b are coprime, we can prove the 
irrationality of /2 by the method of The idea is that a and b have 
to be even, and hence we can replace them by the integers a; = a/2 and b; = b/2. 
But then a, and 0 are also both even, so in turn we can replace each of them by its 
half, retaining the property of being integers. We would then be able to continue 
this process of dividing by two|ad infimum] (forevermore), yielding a contradiction 
(why?)! Hint: Look for the well-ordering principle later in the book. 
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Figure 3.8.1. For a contradiction we assume that V2 is rational. 


Exercise 3.34. Prove that if x is a positive real number satisfying x? = 2, then x 
is trrational. 


3.8.6. The square root of any prime number is irrational. With the same 
logic as in the proof by contradiction in the previous section, we can prove the 
following. 


Theorem 3.47. If p is a prime, then \/p is irrational. 


The key to this is the following divisibility fact (see Corollary below). 


Lemma 3.48. If a prime p divides the square a? of an integer, then p divides a. 


Given this fact, we leave it to the reader to prove Theorem |3.47 


Exercise 3.35. Prove that if p is a prime, then ,/p is irrational. Hint: Use Lemma 
[3.48] Note that Theorem[B.51] is a generalization of this exercise, but we would like 
you to go through the logic directly instead of appealing to the theorem. 


Just like the fact that square roots of primes are irrational, we have that m-th 
roots of primes are irrational. We consider the case of the prime 3. 


Solved Problem 3.49. Let m > 2 be an integer. Prove that 3'/™ is irrational. 


Solution. Suppose for a contradiction that 3!/™ is rational. Then there exist 
coprime integers a and b such that 


m 


(3.66) gi/m — 7 so that 3 = - 


Thus, 03 = a™, and hence 3 divides a”. Since 3 is a prime, this implies that 3 
divides a (this will be proved in the next chapter). Thus, there exists an integer k 
such that a = 3k. Substituting this into the previous equation, we obtain 

(3.67) essa" Sa"k", so thet b= 3°" k™ 


Since, m—1 > 1, this implies that 3 divides b”. Therefore, 3 divides 6, which 
contradicts that a and 6 are coprime. 
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3.8.7. When square roots are irrational. A semiprime is the product of 
distinct primes. Their square roots are also irrational. 


Solved Problem 3.50. Let p and q be distinct primes. Prove that ,/pq is irra- 
tional. 


Solution. Suppose for a contradiction that ,/pq is rational. Then there exist 
coprime integers a and b such that 
2 


_ oa a 
(3.68) Jp¢ = 5 8° that pq = rom 


Thus a? = b?pq. In particular, p divides a?. Since p is a prime, this implies that p 
divides a. Thus, there exists an integer k such that a = pk. Substituting this into 
the previous equation, we obtain 


(3.69) p’k?a2 = b7pq, sothat pk?a? = b7¢. 
In particular, p divides b?q. Since p and q are distinct primes, we have that p and q 
are coprime. By Theorem [4.23] below, this implies that p divides b?, which in turn 


implies that p divides b. Since p divides both a and 6, we have gcd(a,b) > p > 1, 
which contradicts that a and b are coprime. 


The following (completely general) result can actually be proved with an argu- 
ment similar to the previous ones. It says that the square root of a positive integer 
is either an integer or irrational. 


Theorem 3.51 (Square roots are integers or irrational). Let n be integer of the 
form 


(3.70) n= pe ps? + pr, 
where py < pg <-++: < pp are primes and ky, ko,...,k, are positive integers. If at 
least one of the integers ki, ko,...,k, ts odd, then \/n is irrational. 


Otherwise, if ki, k2,...,kp are all even, then ./n is an integer. 


Proof. Let 2 be such that k; is odd. Write k; = 20; +1, where @; is a non-negative 
integer. We have 


(3.71) n= (pi')?m, thatis, /n=pf/m, 
where 

ki ky . 
(3.72) m= pr py Bide < pre 


It suffices to prove that \/m is irrational. Suppose, for a contradiction, there exists 


a rational number in lowest terms ¢ satisfying 


2 
(3.73) (F) =m. 
Then 
(3.74) a? = bp... pit! ppt? per. 


This implies that p; divides a?. Now, by Lemma[3.48] we have that p; divides a. 
We write 


(3.75) a = Cpj. 
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By substituting this into equation (3.74) and dividing both sides by p;, we obtain 


kia ki bp 
(3.76) op, = 0 ph +p py pe. 
Here, we have to use a believable result from down the road (see Theorem [4.23] and 
Lemma below): Since p; divides b? times a product of primes all of which are 
distinct from p;, we have that p; divides b?. Now we can apply Lemma[B.48] again 
to obtain that p; divides b. We have proved that p; > 1 is a common divisor of a 

a 


and 6, contradicting the assumption that the fraction ¢ is in lowest terms. 


3.8.8. Thinking irrationally about roots ©. Suppose a|stranger|comes up to 
you and blurts out: 


Mesdames et Messieurs, there exist positive real numbers x and y 
that are both irrational and such that x¥ is rational. 


How do you react to Etranger’s assertion? Are they saying gibberish (maybe 
they Forgot About Dre?) or is there something to it? 

Since there are lots of irrational numbers, we might hope that the stranger’s 
statement is true. Perhaps we can let x be the unique positive real number satisfying 
av? = 2. Then, all we have to do is prove that x is irrational. Now, « = Qu/v2 
so maybe it isn’t that difficult. But, alas at this point we seem to be stuck. So, 
apparently, we need to explore another path. 

Since, by now, we have been brainwashed into thinking that proof by contradic- 
tion is a powerful method, let us explore this avenue. Suppose for a contradiction 
that for all positive irrational real numbers x and y, the number x” is irrational. 


2 
Let x = Jo? Then, by our assumption, z is irrational. Now what can we say 
about 


(3.77) av? 2 


Have you figured out the proof of the stranger’s assertion? 
Is this related to the original path we tried to take? 


Here is a way to think about the preceding logical argument visually. Let Q 
denote the rational numbers, and let I denote the irrational numbers. Define the 
function f : R* > Rt by 


(3.78) f(a) =2¥?. 
Observe that 
(3.79) FG) =f" S644 ae Se 


Now let x be any positive irrational number whose square x? is rational. By The- 
orem [3.45] « = 2 is such a number. Consider the schematic picture in Figure 
Schematically, the blue interval represents the rationals and the red interval 
represents the irrationals. 


Exercise 3.36. Give a (formal) proof of VEtranger ’s assertion at the beginning of 


Subsection 3.8.8] 
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Figure 3.8.2. x is irrational. If c maps by f to a rational number, then we 
have that 2V? is rational. If z maps by f to an irrational number y, then we 
use that yv? is rational. 


Goal: Case 1: Case 2: 
vB 3B a) 
jay, 2 <Q V2 €Q v2 40, (PB ) =2€Q 


Figure 3.8.3. In Case 1, « = V2. In Case 2, = Jar, In both cases, 
y = V2. In both cases we also have that x and y are irrational, whereas x¥ is 


rational. 


3.9. Pythagorean triples 


Baseball fans know: 


These are better than the Pythagorean doubles, but not as good as the 
Pythagorean homers. 


And now we discuss a topic (not baseball) that has been of mathematical 
interest since antiquity. 


3.9.1. Right triangles. Suppose that we are given a right triangle, that is, a 
triangle where one of the (interior) angles is a right angle of 90°, i.e., 7/2 radians. 
Call the lengths of the two sides (called legs) incident to the right angle a and b. 


Call the remaining side, the hypotenuse\ c. The Pythagorean Theorem [3.4] then 
says that 


(3.80) a+h=c". 


Definition 3.52. If a, b, and c are all positive integers, then we say that (a, b,c) is 


a Pythagorean triple, We call a and b leg lengths and c the hypotenuse length 


of the Pythagorean triple. 


Example 3.53. Since 3? + 4? = 5”, we have that (3, 4,5) is a Pythagorean triple. 
Another Pythagorean triple is (5, 12,13). For a few more examples, see Table[3.9.1] 


If (a,b,c) is a Pythagorean triple, then clearly we have 
(3.81) 0<a,b<e. 
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Definition 3.54. We say that a Pythagorean triple (a,b,c) is primitive if a,b,c 
are coprime; that is, the only positive integer dividing each of a, b,c is 1. 


In particular, a primitive Pythagorean triple (a, b,c) cannot have a, b,c all even. 


Example 3.55. The Pythagorean triple of positive integers (3,4,5) is primitive, 
whereas the Pythagorean triple (9,12,15) is not primitive since 3 divides each of 
9,12,15. See Figure [3.9.1] 


10 


4 8 


Figure 3.9.1. Pythagorean triples. The one on the left is primitive, and the 
one on the right is not primitive. 


3.9.2. Pythagorean triples and parity. The following result gives constraints 
on the parities of integers in Pythagorean triples. 


Lemma 3.56. If (a,b,c) is a Pythagorean triple, then at least one of a and b is 
even. 


Furthermore, if (a,b,c) is primitive, then exactly one of a and b is even (and 
one is odd), and hence c is odd. 


Proof. For a contradiction, suppose that both a and 6 are odd. Then there exist 
non-negative (actually positive) integers k and @ such that a = 2k+1 and b = 2é€+1. 
We have 


(3.82) =a? +b? = (2k41)? + (204-1) =4(h? +k4+ F428) 42. 


This implies that c? is even, which in turn implies that ¢ is even. Thus there 
exists an integer m such that c = 2m. Hence c? = 4m? is a multiple of 4, which 
contradicts (3.82). We conclude that at least one of a and 0 is even. 


Now suppose that (a, b,c) is primitive. If a and b are both even, then c is even, 
which is a contradiction. Thus, in this case exactly one of a and b is even, and 
exact one of them is odd. Finally, this implies that c? is odd, which implies that c 
is odd. 


Exercise 3.37. Prove that if (a,b,c) is a Pythagorean triple, then a+b—c is even. 
Exercise 3.38. Let (a,b,c) be a primitive Pythagorean triple. Prove: 

(1) Exactly one of (a,b,c) is divisible by 3. 

(2) Exactly one of (a,b,c) is divisible by 4. 
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(3) Exactly one of (a,b,c) is divisible by 5. 
In particular, abc is divisible by 60. 
Exercise 3.39. Prove that if (a,b,c) is a Pythagorean triple, then 
(c—a)(c— b) 
2 


is a perfect square. (You may first check a few examples to give you confidence that 
this statement is true.) 


(3.83) 


3.9.3. Euclid’s formula. 


Theorem 3.57. There are an infinite number of primitive Pythagorean triples. 


This follows from: 


Lemma 3.58 (Euclid’s formula). If m and n are positive integers with m > n, 
then 


(3.84) a=m—n*?, b=2mn, c=m?4+n? 
yields a Pythagorean triple. 

Observe that 
(3.85) c-b=(m—n)* 


is a perfect square, as well as 


(3.86) =n 


being a perfect square. 


For example, for every positive integer n, by taking m =n+1 we obtain the 
Pythagorean triple 


(3.87) a=2n+1, b=2n?+2n, c= 2n?4+2n+1. 


Note that, here, a<b=c—1<ce. In particular, any odd integer at least 3 is the 
leg length of a Pythagorean triple. 


Examples of Euclid’s formula are given in Table B.9.1] 


Table 3.9.1. Some instances of Euclid’s formula for Pythagorean triples. 
Primitive triples are in red. 


mni|a be 
2 1/3 4 5 

3 1/8 6 10 
3. 2) :o a2 13 
4 1/15 8 17 
4 2/12 16 20 
4 3] 7 24 25 
5 11424 10 26 
5 221 20 29 
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Exercise 3.40. The Pythagorean triples of positive integers less than 100 after 
Table |3.9.1] are 


(12, 35,37), (9,40,41), (28,45,53), (11,60,61), (16,63,65), (33,56, 65), 
(48,55,73), (13, 84,85), (36,77,85), (39,80,89), (65,72, 97). 


For each of these Pythagorean triples, find the corresponding pair of positive integers 
(m,n) satisfyingm > n and Euclid’s formula (8.84). (You may need to switch a 
and 6 to get the form (3.84) in the same order.) 


Exercise 3.41. Prove that Theorem [3.57 follows from Lemma |3.58 


Exercise 3.42. Prove Lemma |3.58} Hint: This is just a calculation. 


17 


13 3 25 


12 15 24 


Figure 3.9.2. Visualization of three primitive Pythagorean triples drawn to 
(different) scale(s). 


3.9.4*. Primitive Pythagorean triples. Conversely, we have the following re- 
sult. 


Lemma 3.59 (Converse of Euclid’s Lemma). Jf (a,b,c) is a primitive Pythagorean 
triple with a odd and b even, then there exist positive integers m and n withm>n 
such that (a,b,c) is of the form (8.84): 


a=m?—-n?, b=2mn, c=m?+n’. 


Moreover, m and n are coprime, and one is even and one is odd. 


Proof. Denote P = (2, 2). Since ean — (2)" = 1, the point P lies on the unit 
circle in R?, and also P lies in the interior of the first quadrant of the plane since 
a, b,c are all positive. 

Let Q = (0,—1) be the ‘south pole! of the unit circle, and let R be the inter- 
section of the line segment PQ with the (positive) x-axis. We easily find that (for 
example, by similar right triangles; see Figure [3.9.3]) 


b 
= b 
(3.88) n= ( c 2-0) =( 0). 
le] at+e 
On the other hand, we can write P as an of Q and R: 


(3.89) (2.2) =P= = 1(0,-1) +¢(—2.0) = (Sot i), 


where the parameter t 4 0 is determined by the condition that P lies on the unit 
circle. So we obtain 


(3.90) 1=¢ (a+oR +(t—1)?; 
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Figure 3.9.3. Similar triangles used to derive formula (3.88) for R. 


that is, after simplifying and factoring out the non-zero variable t, 


b2 
3.91 t{| ——5,4+1])-2=0. 
ak (arg tt) -2=0 
Therefore, 

2 2 2 
(3.92) f= ee) 


Gor tl PH (atee 


By substituting this into (8.89), we obtain the identities 


(3.93) b —-.2b(a +e) a (a+c)?—b? 
ce b+(a+c)?’ ec b+(a+c)?’ 
Now let g = gcd(b,a +c), and define m := a and n := - We then obtain 
2 2 a2 
(3.94) b _ _2mn . a man 
c m+n? c m+n? 
Finally, we claim that (8.84) holds; that is, 
(3.95) a=m—-n=:4, b=2mn=:b, c=m? +n? =% 


To see this, first observe that (3.94) says that 


bb ai 
(3.96) =2, S28, 
Cc Cc Cc 
so that 
(3.97) ay ae ag, 
a bee 


We also write this as (a,b, @) = k(a,b, c). 
Clearly k € Q since it is equal to a quotient of integers. 


Claim. k is a (positive) integer. 
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Proof of the claim. Suppose for a contradiction that k is not an integer. Then 
k =", where r,s € Z* satisfy ged(r,s) = 1 and s > 1. We have that 
(3.98) @=a, b=+b, =e 

8 8 8 

are all integers. Since gcd(r,s) = 1, by Theorem [4.23] this implies that s divides 
each of a,b,c. We have a contradiction to the assumption that the Pythagorean 
triple (a,b,c) is primitive. q.e.d. the claim. 

Note that gced(m,n) = 1 by Theorem[I.50] We will show that the Pythagorean 
triple (a, b, é) is primitive, in which case (8.95) follows since k is a positive integer 
which then must be equal to 1. 


Suppose for a contradiction that the Pythagorean triple (4, b, ¢) is not primitive. 
Then the integer k in (3.97) satisfies k > 1. 

Let p be an odd prime dividing k. Then p divides each of 2mn, m? — n?, and 
m? + n?. 

Claim. p divides both m and n, a contradiction to gcd(m, n) = 1. 

Case 1: Since the prime p > 2 divides both 2m? and 2n? and since ged(p, 2) = 
1, by Theorem p divides both m? and n?. By Corollary this implies that 
p divides both m and n. q.e.d. the claim. 


By the claim, k is equal to a power of 2; that is, k = 2° for some positive integer 
t. By (8.97), we have 


(3.99) m—n? =4=2'a, 2mn=b=2b, me4+n? =e=2c. 


Since gcd(m,n) = 1, we cannot have both m and n even. Since 2 divides 
m? — n?, we have that m and n have the same parity and hence m and n are both 


odd. However, 2mn = 2°b now yields a contradiction since ¢t > 0 and b is even. 


We conclude that the Pythagorean triple (4, b, é) is primitive and hence equal 
to (a,b,c). The lemma follows. 


So, in conclusion, we have also proved Theorem by Exercise [3.41 


3.10. Quantifiers 


So far in this book we have had a smattering of existential and universal quantifiers 
in our discussions. In this section we give a more ‘conscious)’ discussion of these 
quantifiers. 


3.10.1. Existential quantifiers. 


Question: What is an existential crisis? 
Answer: When you forget the meaning of J. 


An existential quantifier is about the existence of something. That is, there is 
something of some sort. 
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3.10.1.1. Existence. (existence): 


4means “there exists”. 


We call J an existential quantifier 


For example: 


Theorem 3.60. There exists an irrational real number. 


Proof. By Theorem [B.45] V2 is an irrational real number. 


Since we are asked to prove the existence of an “animal”, in this case an ir- 
rational number, all we need to do is produce just one of this animal. So we are 
done! 


So, if it comes easy, the template for the proof of the existential statement: 
x € X, P(x) is 


Jean € X and P(Jean) is true. 


3.10.1.2. Non-existence. But, in general and according to Ringo Starr, ‘Jt 


Don’t Come Easy”. Consider the statement (non-existence): 
‘There is no free lunch. 


Mathematically, as long as a “free lunch” is a well-defined term, this says: 


There does not exist a free lunch. Le., 4 free lunch. 
3.10.1.3. Proving existence and non-existence. The following comes easy: 


Solved Problem 3.61. Prove that there exists a rational number x satisfying 
z>landa<2. 


Solution. Let «= 3. Then z is a rational number, x > 1, and x < 2. 


And it can be easy when it don’t come easy: 


Solved Problem 3.62. Prove that there does not exist a real number x satisfying 
x<landa> 2. 


Solution. Suppose for a contradiction that there does exist a real number x 
satisfying x < 1 and x > 2; i.e., 2 <a. We then have that 2 < 1 by the transitivity 
of <. This is a contradiction. 


Exercise 3.43. Prove that there exists x € R satisfying both x >a and x < b if 
and only if a <b. 


Exercise 3.44. Prove that for every positive integer n, there exists a prime p such 
that p > n. 


Exercise 3.45. Prove that for every positive rational number r, there exists a 
positive rational number s satisfying s < r/100. 
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3.10.2. Universal quantifiers. A universal quantifier is about everything of 
some sort having some property. Namely, ubiquity. Not much ado about everything: 


V means “for all”. 


We call V a universal quantifier 


For example, consider the statement: “For all x € R, 2? > 0.” This means that 
the square of every real number is non-negative. 


How not to prove a universal statement: 
Joan € X and P(Joan) is true 


. unless: Joan is the only element of X! 


On the other hand, X may consist of ‘Jean and Joan and a who knows who’, 
according to Carlos Santana. In particular, if X has an infinite number of elements, 
then you would have to check that P(x) for all a in X, which is an infinite number 
of x’s. 

For example, we cannot just say that 7 € R and 1? > 0; therefore, Vz € R 
x? > 0. But we can argue: Let x be (any) real number. Blah, blah, blah|§ 
Therefore x? > 0. 


2 


Solved Problem 3.63. Prove that for every rational number r, r* is a rational 


number. 


Solution. Let r be any rational number. Then there exist integers a and b 
2 
such that r = $ (so b #0). We then compute that r? = $. Since a? and b? are 
integers (and 6? 4 0), we conclude that r? is a (well-defined) rational number. 0 


Exercise 3.46. Prove that for every rational number r and s, r+ s and rs are 


rational numbers. 


3.10.3. Disproving existential statements. Suppose that we are asked to dis- 
prove that there exists a unicorn on Earth. Assuming that any unicorn on Earth, 
if it exists, is an animal, this is equivalent to proving the following universal state- 
ment: For every animal on Earth, the animal is not a unicorn. 


More abstractly, consider the existential statement: 
Azo E Xx, P(a09). 
To disprove this statement, we need to prove the following universal statement: 


Va € X, not P(a). 


In other words, this last statement is equivalent to the negation of the statement 
Jay € X, P(xo), which we may write as fx € X, P(x). 


To disprove an existence statement, i.e., to prove a non-existence statement, 
one method is proof by contradiction, as in Solved Problem B.62] 


To summarize, the negation of an existential statement is equivalent to a univer- 
sal statement. For example, if we flippantly think of a solution x to the inequalities 
2<a <1 as being evil, then we are saying that the statement “there is no evil” is 


5 We leave it to you, dear reader, to recall the details of the proof, where the idea is to consider 
the two cases of squares of (i) non-negatives and (ii) negatives. 
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equivalent to the statement “it’s all good (from Diego to the Bay)”, assuming that 
evil and good are negations of each other! On the other hand, if 
then not all is good! In general, by taking negations in the first statement of this 
paragraph, we see that an existential statement is equivalent to the negation of a 
universal statement. 


3.10.4. Disproving universal statements. Now suppose we are asked to dis- 
prove the following statement: For every fish on Earth, the fish can swim. Dis- 
proving this universal statement is equivalent to proving the following existential 
statement: There exists a fish on Earth that cannot swim. 


More abstractly, consider the statement 


Wye Y, Q(y). 


To disprove this statement, all we need to do is provide an element Shaq € Y such 
that Q(Shaq) is false. This is because the negation of the universal statement above 
is equivalent to the existential statement 


yo € Y, not Q(yo). 


3.10.5. Examples of earlier statements as universal or existential state- 
ments. In this subsection we rephrase statements of the results we have encountered 
in Chapter [I] more formally in terms of existential and universal quantifiers, and 
implications. 


Recall that E is the set of even integers and O is the set of odd integers. 


Theorem says: Va,b € Z, a,b € E> a+b e€E. In other words, the sum 
of any two even integers is even. 


Theorem [1.27] says: Va,b € Z,a,be OS a+beEE 
Corollary [1.30] says: 


(1) Va € Z,a € E> a? €E. In other words, the square of any even integer is 
even. 
(2) Va€ZacOsadecO. 
Theorem [1.52] says: Va,b € Z, 
(3.100) claandclb => Ym,n€Z, cl(ma+nb). 


In other words, for any integers a, b,c, if c divides both a and b, then c divides any 
integral linear combination of a and b. 


Exercise [1.23] says: 
(1) Va, b,c € Z, alb => albc. 
(2) Va,b € Z, alb => alb?. 


3.10.6. More mathematical implications: Implications with free variables. 
Regarding “more”, the title of this subsection is punny (but not puny!). We look 
at some implications with a|free variable) These are implications of which we have 
already considered many examples. For example, the implication with the free 
variable « € R: For all x € R, if x 40, then 2? > 0. 
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Consider the general form of an implication in one free variable: 
(3.101) Va € X, P(x) > Q(a). 
In terms of sets, we can think of this statement as saying the following. Let 
B:= {x € X : P(x) is true}, 
Q:= {x € X : Q(z) is true} 
be the truth sets of P and Q, respectively. That is, 8 and Q are the sets of all x 
where P(x) and Q(x), respectively, are true. 
We have that is equivalent to the set containment of truth sets (see 
Figure 3.10.1): 
(3.102) Bc. 
Indeed, (8-10T) says that if P(x) is true, then Q(z) is true. On the other hand, 


(8.102) says that if e € 8%, then x € Q. By the definition of the sets 8 and Q, this 
says that if P(x) is true, then Q(«) is true. This proves the aforestated equivalence. 


As a device to remember this equivalence better, let us define for the moment 
that x is a P-true element if P(x) is true and z is a Q-true element if Q(a) is true. 
So $8 and Q are the sets of P-true and Q-true elements, respectively. So, in this 
language, says that any P-true element is a Q-true element. 


x 


Figure 3.10.1. Visualizing an implication: The rare occurrence of a duck 8 
in a pond Q surrounded by sand X. 


Consider, as an example, the following implication: 
Let x € R. If x > 0, then x? > 0. 


This is a true statement and it illustrates all three “true” rows of the four rows 
in the truth table for >. Let P(x) = (x > 0) and let Q(x) = (x? > 0). Let us 
consider the three cases: 

(1) « >0. Then P(z) is true and Q(z) is true. 

(2) « =0. Then P(z) is false and Q(z) is false. 
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(3) « <0. Then P(z) is false and Q(z) is true. 


In all three cases, the statement P(x) => Q(z) is true. Since, for every real number 
x we have the trichotomy x > 0, x = 0, or x < 0, we conclude that the statement 
that P(x) > Q(x) for all x € R is true. 

The example above was given to illustrate a point about implications. You may 
have noticed that we earlier proved the following stronger statement: 


Let x ER. If e £0, then x? > 0. 


y y 
oO ® 
fo. oO 
@ 
o ® 
@ > Xx > x 
Va, dy Ir, Ay 
Figure 3.10.2. Visualizing universal and existential quantifiers of two vari- 


ables. The right and left grids visualize parts (1) and (2) of Example 3.64 
respectively. 


3.10.7. Mixing universal and existential quantifiers of two variables. Let us 
start out very simply. 


Example 3.64 (Fun with Dick (existential) and Jane (universal) ) 


(1) de € R, Sy € R, « > y. Here, we just have to find two real numbers where 
one is greater than the other. So, randomly, we let x = 1 and y = 0. Since 
1 > 0 and hence x > y, we have proved the statement. 


(2) Va € R, dy € R, x > y. Here we have to show that no matter what z is, 
there is some y greater than x. So we: Let « € R. (Given this x,) we then 
let y = x —1, say. Since « > x —q and hence xz > y, we have proved the 
statement. 

x é€R,Vy eR, a> y. This statement is false. For: Let « € R. Then let 

=a. Then x # y. This proves that for all x € R, there exists y € R such 

hat (not x > y). 


(3) 


ae lw 


(4) 


zc e€R,VyeR,x> y. We leave it as an exercise to prove that this statement 
is false! 


Exercise 3.47. Prove each of the following statements. 


(1) de ER, Sy ER, x? = -y?. 
(2) Vz ER, Jy ER, x? < y— 100. 
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3) dz ER, Vy ER, « < y* — 100. 
4) Ve ER, VyE R, 2? > -y?. 


Exercise 3.48. Disprove each of the following statements. 


1) Se ER, Jy ER, a? =-y? - 1. 
) Va ER, dye R, x > y? — 100. 
) de ER, Wy ER, 22 <y+100. 
) V2 ER, Vy ER, 2? > -y?. 


w 


y 


Solved Problem 3.65 (Bananas| split). Prove the following statement: 
(3.103) VWyeR, (AteR, 2? <y) or (Ax €R, 0<2<-y). 


Solution. Let y € R. 


Case 1: y > 0. In this case, since 0? < y, the statement (Ar € R, x? < y) is 
true since we may just take x = 0. 


Case 2: y < 0. In this case, we observe that 0 < —$ < —y. So the statement 


da €R, 0 < a2 < —y) is true by taking z = —$. 


— 
| 


Since for every real number y we have y > 0 or y < 0, the statement is proved. 


Post-Mortem Analysis. Denote by P(y) the substatement (Jz € R, 2? < y), 
and denote by Q(y) the substatement (dz € R, 0 < x < —y). Then we see that 
statement (3.103) takes the general form 


(3.104) Vy € R, P(y) or Q(y). 


In our solution above, we showed that: 
(1) If y > 0, then P(y) is true. 
(2) If y < 0, then Q(y) is true. 

In fact, we have the stronger facts 


(3.105) Ply) Sy>0 and Qy)Sy<0. 


Exercise 3.49. (1) Prove (3.105). 


(2) Explain why to prove a statement of the general form (8.104), it suffices to 
show that union of the set of y for which P(y) is true and the set of y for 
which Q(y) is true is the set of all real numbers. 


Exercise 3.50. Prove the following statement: 


(3.106) VeeER, (VyER, -y?<a2-2) or (AYER, 2? <y<8). 


3.11. Hints and partial solutions for the exercises 


Hint for Exercise All three statements are false. Use the identities in the 
hint to help find in each case real numbers x and y where the stated identity is 
false. 
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Hint for Exercise We remark that a calculator often does not give exact 
answers (never if the number is irrational). So we do not recommend using a 
calculator. 


(1) is false, whereas (2) and (3) are true. 
Hint for Exercise Consider the special case where x = y = 1. 


Hint for Exercise Take for example the mistaken formula (8.12). Let 
f(x) = Va. Then (3.12) says that f(a +y) = f(x) + f(y), which would be true for 
a linear function but isn’t true for this function. 


Hint for Exercise The area of our right triangle is equal to sab. The same 
is true for every rotation of our triangle. 


Hint for Exercise Regarding the symmetry in left and right, the point is 
that there are two locations for the glasses, one of which we can call “L” and one of 
which we can call “R”. The structure of the argument doesn’t change if we switch 
L with R. 


Hint for Exercise [3.7 Details for Subsubcase 1b(ii): By (8.24), either Bg or Bio 
is the funny bunny. Suppose for a contradiction that Bio is the funny bunny. Then 
By and By; are regular bunnies, and thus (8.24) contradicts (8.22). We conclude 
that Bg is the funny bunny, and it weighs more than the regular bunnies. 


Hint for Exercise [3.8] In Case 1 we are done since Q implies Q or R. In Case 2 
we are done since R implies Q or R. 
Hint for Exercise Separate the proof into two cases: (1) a= b, (2)a< b. 
Hint for Exercise Use the hint and multiply the inequality by +. 
Hint for Exercise Let a and b be non-negative real numbers such that 
a+b=0. Then we have 
(3.107) 0<a=-b<0. 
Continue. 
Hint for Exercise [3.12] If x <0, thenx-2>0-z. 
Hint for Exercise One way to solve the exercise is by truth tables. 
Will illustrate another way for part (2). Let P and Q be statements. 


Case 1. Suppose P is true. Then (P or Q) is true, which implies that (P or 
Q) or ((not P) or Q) is true. 


Case 2. Suppose P is false. Then ((not P) or Q) is true, which implies that 
(P or Q) or ((not P) or Q) is true. 


Since Case 1 or Case 2 is true and in each case we have proved the desired 
conclusion, we are done. 
Hint for Exercise We prove part (2). 

Case 1. Suppose P is true. Then ((not P) and R) is false. Hence the statement 
(P and Q) and ((not P) and R) is false. 


Case 2. Suppose P is false. Then (P and Q) is false. Hence the statement 
(P and Q) and ((not P) and R) is false. 
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Since Case 1 or Case 2 is true and in each case we have shown that the statement 
(P and Q) and ((not P) and R) is false, we are done. 


Hint for Exercise No, it is not a contradiction by the hint. 


Hint for Exercise Proof of (3). Suppose (P and Q). Then P is true. This 
implies that (P or Q) is true. We conclude (P and Q) => (P or Q). 


Hint for Exercise The Venn diagram proof of part (1) is given by Figure 
[3.6.1] 


Hint for Exercise Suppose 2? > 0. Suppose for a contradiction that x = 0. 
Continue. 


Hint for Exercise We prove this by cases. Let a and b be real numbers 
satisfying (a > 0 and b > 0) or (a < 0 and b < 0). Let Case 1 be (a > Oand b > 0). 
Let Case 2 be (a < 0 and b < 0). In each of these cases, use elementary properties 
of inequalities and multiplication to obtain the desired conclusion. 


Hint for Exercise Suppose ab > 0. Firstly, show that a 4 0. Secondly, 
consider two cases: (1) a > 0, (2) a < 0. In case (1) prove that b > 0, and in case 
(2) prove that b < 0. 


Hint for Exercise Suppose P > Q and Q => R are both true. Let P. 
Then since P > Q, we have Q. Continue. 


Hint for Exercise (1) Case 1: P is true. Then P or (not P) is true (since 
P or R is true for every statement R). 
Case 2: P is false. Then (not P) is true, so P or (not P) is true (since R or 
(not P) is true for every statement R). 
Here is the truth table method of the proof: 
P | not P | P or (not P) 
(3.108) T| F | ih 
Tr) © | ay 


(2) This result is saying that an implication is equivalent to its contrapositive. 


(=) Suppose P => Q. Then suppose not Q. Now suppose P. Since P > Q, 
we have Q, which is a contradiction (to not Q). Therefore, P is false; i.e., not P is 
true. We have proved that not Q > not P. 


(<) Suppose: not Q => not P. Then suppose P. Now suppose not Q. Since 
not Q => not P, we have that not P is true, i.e., P is false, which is a contradiction. 
Therefore not Q is false; i.e., Q is true. We have proved P > Q. 


Hint for Exercise The real number x has the property that 
not (« >aand x < b) 

is true. 

Hint for Exercise Suppose for a contradiction that x = 0. 


Hint for Exercise Corollary [3.38(1): The contrapositive is: If a is odd, 
then a? is odd. This is the second statement of Corollary [1.30] 
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Corollary 2): The contrapositive is: If b is even, then b? is even. This is 
the first statement of Corollary 


Hint for Exercise [3.26) Prove the contrapositive using Theorem [1.25 
Hint for Exercise [3.27] Use Theorem [1.28] 


Hint for Exercise 2 is a prime. Now suppose that there exists an even 
prime p besides 2. Derive a contradiction. 


Hint for Exercise Suppose that a positive integer a is a divisor of a positive 
integer b. Then there exists an integer k such that 


(3.109) b = ka. 
Explain why & > 1 and why this yields a contradiction if we assume that a > b > 0. 
Hint for Exercise The contrapositive of this implication is Theorem [1.52] 


Hint for Exercise Suppose that d divides both a and b but d does not 
divide c. Suppose that m and n are a solution to the equation 


(3.110) am-+bn=c. 
Derive a contradiction. 
Hint for Exercise The gcd must be equal to p. 


Hint for Exercise If n is a prime, then ged(n, (n — 1)!) = 1. This is the 
contrapositive of the statement we want to prove. 


Hint for Exercise [3.34] For a contradiction, suppose that ¥/2 is a rational 
number. Then there exist integers a and b such that /2 = ¢. By Theorem [I-50) we 


may assume that ged(a,b) = 1. Now 2 = ae so that 2b? = a®. Continue. 


Hint for Exercise For a contradiction, suppose that ,/p is a rational 
number. Then there exist integers a and b such that,/p = $. Use Theorem [1.50] 
and Lemma|3.48 


~V/2 
Hint for Exercise Let x = or. If x is rational, then we are done. So 
we may assume that zx is irrational. Consider «V2. Does this provide an example 
of l’Etranger’s assertion? 


Hint for Exercise Let (a,b,c) be a Pythagorean triple. By Lemma 
at least one of a and b is even. Consider the two cases: (1) both a and 6 are even 
and (2) exactly one of a and b is even. 


Hint for Exercise Proof of part (1): The remainder of a perfect square 
after dividing by 3 is either 0 or 1. If c? is divisible by 3, this implies that both 
a” and b? are divisible by 3. This in turn implies that each of a,b,c is divisible by 
3, which contradicts the primitiveness assumption. Thus c? has remainder 1 after 
dividing by 3. This implies that one of a? and b? has remainder 0 after dividing 
by 3 and the other has remainder 1 after dividing by 3. Of the two integers a and 
b, the one whose square is divisible by 3 is the one which is divisible by 3, and the 
other isn’t. 
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Hint for Exercise [3.39] Firstly, assume that (a, b,c) is primitive, and use Lemma 
Secondly, reduce the general case to the primitive case. E.g., if (a,b,c) is not 
primitive, then there exists an integer k > 1 such that 


(3.111) oe 
ki kik 
is a primitive Pythagorean triple. 
Hint for Exercise (6,1) gives (35, 12,37), and (5,4) gives (9, 40, 41). 


Hint for Exercise [3.41] By Lemma[3.58] any odd integer at least 3 is the leg 
length of some Pythagorean triple. In particular, any odd prime number is the leg 
length of some Pythagorean triple. Continue. 


Hint for Exercise Let m and n be positive integers with m > n, and let 
(3.112) a=m—n?, b=2mn, c=m? +n’. 

Show that a? + b? = c?. 

Hint for Exercise [3.43] (=): Use the transitivity of inequality. (<=): Consider 
the average of a and b. 

Hint for Exercise There are an infinite number of primes. 

Hint for Exercise [3.45} Divide r by an integer greater than 100. 


Hint for Exercise Since r and s are rational, there exist integers a, b,c, d 
such that r = ¢ and s = §. Calculate. 
Hint for Exercise [3.47] (1) Let « =0. What should we let y equal? 

(2) Given wz, define y appropriately in terms of x. 
Hint for Exercise [3.48] (3) Prove that Vz € R, dy € R, 2? > y+ 100. 

(4) Let « = 0. What should we let y equal? 
Hint for Exercise [3.49} For the first biconditional: Use that if « € R, then 
x? >0. 

For one of the directions of the second biconditional: Let « = a 


Chapter 4 


The Euclidean Algorithm 
and Its Consequences 


277 55 +2 = 277=5-5542 


Figure 4.0.1. A numismatic way to think of this is that $2.77 equals 55 


nickels plus two pennies for your thoughts (inflation)! 


Goals of this chapter: Now we start to get serious about elementary number 
theory. To show that being serious does not mean being difficult ©. To exhibit the 
power of the Division Theorem. To understand the greatest common divisor, a pow- 
erful concept. To solve linear Diophantine equations. To prove the Fundamental 
Theorem of Arithmetic. 


4.1. The Division Theorem 


We have seen the statement of the Division Theorem in §1.4.3] Now we return 
to this fundamental result and discuss its proof as well as a number of important 
applications to arithmetic. 


4.1.1. Division in everyday life. When we divide the integer 277 by 5 to obtain 


a{remainder of 2, we really mean that there exists an integer q (called the quotient) 
such that 


(4.1) 277 = 5q +2; 


namely g = 55. The significance of 55 is that this is the unique integer that produces 
a remainder r with 0 < r < 5; namely r = 2. 


If for example we instead divide —277 by 5, then to get a remainder r with 
0<r <5, we would write 
—277 = 5(—56) + 3; 


namely in this case the quotient is g = —56 and the remainder is r = 3. 
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Now, given any integer a, such as a = 277 or a = —277, and given any integer 
q, for dividing a by 5 with quotient g, we may define the remainder r by 
(4.2) a=5q+r. 


For the Division Theorem, we are looking for the unique integer q that produces 
the smallest positive remainder r. 


Here’s another, more practical, example: The Superbowl, which is on a Sunday, 
is 50 days from now. What day of the week is it today? Without batting an eyelash, 
we announce that today is Saturday! How did we know this? 
Namely, as an example of modular arithmetic, since there are seven days in a week, 
we did the calculation 50 = 7-7+ 1 and then used this to subtract 1 day from 
Sunday. 


Figure 4.1.1. Counting backwards from the day of the Superbowl to today! 


Now suppose that we are closer to the big day. The Superbowl, which is at 3 
pm Pacific Time, is 100 hours from now. What time of day is it? Again, we use 
modular arithmetic and calculate that 100 = 4-24+ 4. Since there are 24 hours in 
a day, we conclude that it is 11 am Pacific Time. Time to get chips and salsa! 


| t—> hours 


Figure 4.1.2. Using division to 
know?) 3 pm is 15:00 hours in the 96 = 4- 24 hours 
before then is still 15:00 hours. Since 100 = 96 +4, the current time is 
15:00 — 4:00 = 11:00 hours, i.e., 11 am. 


4.1.2. Statement of the Division Theorem. In general, we may uniquely di- 
vide any integer a by any positive integer m to obtain a remainder r with 0 < r < m. 


The says: 


Theorem 4.1. Let a be an integer and let m be a positive integer. Then there are 
unique integers q and r such that 


(4.3) a=mq+r and 0<r<m. 


So, for example, when we write 
277 =5q+r, withO<r<65, 
the only possibility is that g = 55 and r = 2. 
Similarly, there is only one solution (q,1) to 
1234567890123456789 = 5q¢+r, withO<r<65. 
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It is rather easy to see that r = 4 (thanks to our base 10 number system), even 


though we cannot see quite as quickly what q is (but we know that q exists and it 
is unique). 


4.1.3. Even and odd integers revisited. Before we proceed to prove the Di- 
vision Theorem, we consider the simplest non-trivial special case. In particular, if 
we take m = 2 in the Division Theorem, then we obtain the statement: 


Theorem 4.2. Let a be an integer. Then there are unique integers q andr such 
that 


(4.4) a=2q+r and O0<r<2; that is,r=0 orr=1. 


Observe that if a is even, then r must equal 0, while if a is odd, then r must 
equal 1. We leave the proofs of these two facts as easy exercises. Consequently, 


(4.5a) aiseven <= a= 2k for some k € Z, 
(4.5b) aisodd = a=2k+1 for some k ce Z. 
Solved Problem 4.3. Let n andn+1 be consecutive integers. Prove that one of 


them is divisible by 2 (even) and one of them has remainder 1 when divided by 2 


(odd). 


Solution. Case 1: n is even. Then n is divisible by 2, and so n = 2k for some 
integer k. Hence, n+1 = 2k+1, so that n+1 has remainder 1 when divided by 2. 
Case 2: n is odd. Then, by Theorem [4.2] n has remainder 1 when divided by 
2, and so n = 2k +1 for some integer k. Hence, n+ 1 = 2k +2 = 2(k +1), so that 
n+ 1 is divisible by 2. 


Exercise 4.1. Letn,n+1,n+2 be consecutive integers. Prove that exactly one of 
these 3 integers is divisible by 3. 


4.1.4. The well-ordering principle. We will need the following axiom for the 
proof of the Division Theorem. 


Axiom 4.4. The well-ordering principle is the statement: 
Every non-empty set of positive integers has a least element. 
In other words: 


If X is a non-empty subset of Z*, then X has a minimum element. 


We give a proof of the following in 94.11] (the appendix to this chapter) below. 


Theorem 4.5. The Aziom [2.5] of mathematical induction and the well-ordering 
principle are equivalent. That is, each implies the other. 


We can slightly generalize the well-ordering principle by the following. 


Exercise 4.2. Let b be an integer. Let Y be a non-empty subset of Z such that 
y= b for ally © Y. Prove that Y has a minimum element. Hint: Define 


(4.6) X:={y-—b4+1:yeyY}. 
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4.1.5. Proof of the Division Theorem. In this subsection we prove the Divi- 
sion Theorem [4.1] If you prefer to see the proof in a more concrete setting, first see 
the example after the proof. 


Step 1. Proof in the case where a > 0. Note that if a = mq+yr, then 
r=a-—mgq. 


Existence (when a > 0). Consider the set S of all non-negative remain- 
ders; that is, define 


(4.7) S={a—mg| Ge Z}NZ-. 
The remainder we seek is the smallest one: 
(1) Since a— m-0=a> 0, we have a € S, so that S is non-empty. 


(2) Since S Cc Z* and S 4 @ (the empty set), by the well-ordering principle 
(Exercise [4.2] with b = 0), there exists a smallest element of $, which we call r. 


Since r > 0 and since by the definition of r € S' there exists q € Z such that 
r=a-—-mq, ie, a=maq+r, 


we just need to show that r < m in order to prove the existence part of the Division 
Theorem. 


Observe that r — m is a remainder since 
r—m=(a-—mq)—-m=a-—m(q+1) 


and q+1€ Z. On the other hand, since r— m < r and since r is the smallest 
element of S, we must have r—m ¢ S. By the definition of S, this implies r—m < 0; 
Le, r< ml] This completes the proof of the existence of g and r satisfying (43). 


Uniqueness. (Here we do not use the condition that a > 0.) Suppose 
that q,r and q,7 are both pairs of integers such that we have both 


a=mqtr and 0<r<m 
and 
a=mq+r and 0<r<m. 
Then mq +r = mq+7 and hence 
F#—r=m(q—-q. 
Observe that since 7 < m and r > 0, we have that r -—r <m—O=™m. Similarly, 
since 7 > 0 and r < m, we have that r —r >0-—m=-—m. Thus 


—m<r-r<m, 
and equivalently, 
—m<m(q-qg<m; ie, -l<q-q<l. 
Since q—q is an integer and since it is strictly between —1 and 1, we conclude that 
q-q=9, 


1To wit, r—_m is a remainder smaller than r (the smallest non-negative remainder), so it must be 
negative. 
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which in turn by * — r = m(q— @) implies that 


r—-r=0. 


Hence q= q andr =7r. This completes the proof of uniqueness. 


Step 2. (Existence when a < 0.) It only remains to prove existence in the 
case where a < 0. Again we let 


S={a—mg@| qe Z}nZ 
be the set of non-negative remainders. 


Observe that a(1—m) € S, so that S is non-empty. Indeed, we have that 
a(1—m) =a—ma=a-—m4, where Gg = a, and that a(1—m) > 0 since a < 0 and 
1—m <0 (from m > 0). This says that a(1 — m) is a non-negative remainder and 
hence is in S. 


This is the only place in the proof where we previously used the condition that 
a> 0. Thus the rest of the proof of existence is exactly the same and we are done 
with the proof of the Division Theorem. 


Example 4.6. Consider the division in (41), where the solution is given by 


(4.8) 277 =5-5542. 
We can obtain the quotient g = 55 and remainder r = 2 as follows. Let 
(4.9) S = {277-5¢:G€ Z}NZ, 


where Z= denotes the set of non-negative integers. By inspection, we see that (see 
also Figure |4.1.3) 


S = {2,7, 12,17, 22,27,...} ={r+5p:peZ7}. 


The remainder r = 2 is the minimum element of S. One then recovers the quotient 
q by 2 = 277 — 5q; that is, 
_ 277-2 
aie: 
To reprove in this more concrete case that these are the unique remainder and 
quotient, we suppose that qg and r satisfy 


= 55. 


5q+r=5-55+2, where 0O<r<5. 
By elementary algebra, we have 
r—2=5(55 —4q). 
Since 0 < r < 4, this implies that 
—2 < 5(55 — q) < 2; 
that is, 


2 2 
1< 5 oO fae h 
Since q is an integer, we conclude that 55 — q = 0; that is, g = 55. From this we 
then easily recover that r must be equal to 2. 


The reader may wish to compare this example with the general proof of the 
Division Theorem. 
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< 


0 2 7 12 17 22 272 277 = =282 


Figure 4.1.3. The Division Theorem illustrated: The set S of non-negative 
remainders in consists of the red point and the (infinitely many) blue 
points, where the red point is the minimum element of S and is the smallest 
non-negative remainder. 
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Figure 4.1.4. The general version of Figure[4.1.3] The smallest remainder r 
by definition satisfies r > 0. Since r is the smallest, r — m < 0, which implies 
r<m. By virtue of r € S, we have r = a — mq for some integer g. That is, 
a=mqt+r, where0<r<m. 


Exercise 4.3. Remarkably, the integer n2 +n+ 41 is prime for 0 <n < 39. Prove 
that it is not a prime for both n = 40 and n = 41. (So the highest level we can 
reach is level 89. We proved that levels 40 and 41 are unattainable, not to mention 


Level 42!) 
Does there exist a positive integer k such that n2-+n-+k is prime for alln € Z2 ? 


The following generalizes Exercise 


Exercise 4.4. Let s be an integer and let m be a positive integer. Show that exactly 
one of the m integers s+1,5+2,54+3,...,5+m is divisible by m. Hint: Let r be 
the remainder of s+ m. Note that 1<m—r<m. Consider s+(m-—r). 


Exercise 4.5. Let a be an integer. Prove that there exist unique integers q' and r’ 
satisfying 


(4.10) a=5q tr and 15<9r’ <19. 
Hint: Use the Division Theorem. 


4.1.6. The remainder and quotient functions. The Division Theorem in fact 
defines two functions. Let m be a positive integer. Define 


(4.11) Rm = {0,1,2,...,m-— 1}. 

Another way to describe this set is 

(4.12) Rm = {ae Z:0<a< mb}. 

This is the same as the set of remainders. Define the remainder function 
(4.13) r:Z—> Rm 

by 

(4.14) r(a) =7, 


where r is the unique remainder given by the Division Theorem. Define the quo- 
tient function 


(4.15) q:Z3Z 
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by 

(4.16) q(a) = 4, 

where q is the unique quotient given by the Division Theorem. So we have 
(4.17) a=maq(a)+r(a), 0<r(a)<™m. 


Example 4.7. For m = 5 we have for example that 
2=r(2) =r(-3) =r(7) =r(—-8) = r(12) = r(-13) = r(17) = r(—18) = ---, 
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Figure 4.1.5. Visualization of the remainder map r: Z + Re (m = 6). 
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Figure 4.1.6. Alternatively, we can picture the remainder map r: Z > Re 
by placing the inputs below the dots and the corresponding outputs above the 
dots. Note that there is a one-to-one correspondence between the outputs and 
the colors. 


Solved Problem 4.8. Show that 
(4.18) r(a)=0 if and only if m divides a. 


Solution. Suppose that r(a) = 0. Then a = mq(a)+r(a) = mq(a). Thus, 
m divides a. On the other hand, suppose that m divides a. Then there exists 
an integer q such that a = mq. Thus, the remainder for this q is r = 0. By the 
uniqueness of the remainder, r(a) = 0. 


Exercise 4.6. Let a and b be integers such that r(a) = r(b) and q(a) = q(b). Prove 
that a = b. 


Exercise 4.7. Let n be an even integer. Prove that n orn+ 2 is divisible by 4. 


4.1.7. Division and squares. What can we say about the remainders of per- 
fect squares? After all, perfect squares aren’t just any numbers. They’re perfect 


(but not perfect numbers) and they’re square numbers (but not |square)! In this 


subsection we consider this question with respect to division by m, where m is a 
small positive integer. Besides squares, we also consider sums of two squares. 


Firstly, we recall the following from Corollary 3.39} Let a be an integer. Then 
(4.19) 2 divides a if and only if 2 divides a”. 
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Theorem 4.9. Let a be an integer. Then 
(4.20) 3 dividesa = 3 divides a”. 


Proof. (=) This is a special case of Exercise [1.23] (note that a divides a”). 


(<) We prove the contrapositive! Namely, we prove that if 3 does not divide 
a, then 3 does not divide a?. 


3qt+1 


3q 


3 3q 3q+1 


Figure 4.1.7. Visualizing that (3q+ 1)? has remainder 1 when divided by 3. 
Can you visualize the remainder of (3q + 2)?? 


Suppose 3 does not divide a. Now, when we divide a by 3, by the Division 
Theorem we obtain a unique remainder r that is equal to 0, 1, or 2. Since 3 does 
not divide a, r #0. Thus we have two cases: 


Case 1: r =1. In this case there exists an integer q such that a = 3g +1. We 
then compute that 


a? = (3g + 1)? = 9q? + 6g + 1 = 3(3q + 2g) +1, 
which by (4.18) shows that 3 does not divide a? (since our remainder is equal to 
i), 
Case 2: r = 2. In this case there exists an integer gq such that a = 3qg+2. We 
then compute that 


a” = (8q+ 2)? = 9q7 + 12g +4 = 3(3q? + 4g +1) +1, 


which shows that 3 does not divide a? (since again our remainder is equal to 1). 


This proves the desired contrapositive statement. 


Exercise 4.8. Prove that ifn is the square of an integer, then n = 3q orn = 3q+1 
for some q € Z. In particular, if m is an integer of the form m = 3q +2 for some 
q € Z, then m is not the square of an integer. So, for example, we know immediately 
that 999632 is not a square, neither is 1234567892. 


When we consider arithmetic from the point of view of dividing by a given 
positive integer m, we say that we are doing arithmetic modulo m. 
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Solved Problem 4.10 (Sums of two squares modulo 4). (1) Let m and n be 
integers such that 4 divides m? + n?. Prove that both m and n are even. 


(2) Let x and y be integers such that 4 divides x? + y? +2. Prove that both x 
and y are odd. 


Solution. (1) If m is even, then m = 2k for some integer k. In this case, 
m? = 4k? is divisible by 4. Since 4 divides m? + n? by hypothesis, we have that 4 
divides m? + n? — m? = n?. Thus, n is even. 

Now, suppose that m is odd. Then m = 2¢+1 for some integer ¢. This implies 
that m? = 4(@? + 0) +1. Thus 4 divides m? — 1. Since 4 divides m? + n? by 
hypothesis, we have that 4 divides m? + n? — (m? — 1) =n? +1. Hence n? is odd, 
which implies that n is odd. This implies that 4 divides n? — 1. However, this 
contradicts that 4 divides n?+1. We conclude that m is not odd. By the preceding 
paragraph, this implies that both m and n are even. 


(2) If both 2 and y are even, then 4 divides x? + y?, which contradicts our 
hypothesis. Thus, at least one of x and y is odd. Without loss of generality, we may 
assume that x is odd. By the proof of part (1) we have that 4 divides x? — 1. Since 
4 divides x? + y? +2 by hypothesis, we obtain that 4 divides x? + y? +2—(a?—-1) = 
y? +3. Thus y? is odd, which implies that y is odd. 

Remark. For a sum of perfect squares a? + b?, we have that after dividing by 
4 the remainder is exactly one of the following: 


(1) 0, in which case both a and 6 are even, 
(2) 1, in which case for a and 6 one is even and one is odd, 
(3) 2, in which case both a and 0 are odd. 


It is not possible for the remainder to be equal to 3. We leave it as an exercise for 
the reader to verify this! 


Exercise 4.9. Prove that ifm and n are integers such that 3 divides m? +n?, then 
3 divides both m and n. 


Exercise 4.10. Prove that ifn is the square of an integer, then n = 4q orn = 4q+1 
for some q € Z. 


Exercise 4.11. Prove that ifn is the square of an integer, thenn = 5q,n = 5q+1, 
orn =5q+4 for some gq € Z. 


The remainders of perfect squares are called quadratic residues. We will return 
to this topic in a later chapter. 


Solved Problem 4.11 (Quadratic residues modulo 7). Let n be a perfect square. 
Prove that there exists an integer q such thatn = 7q+1r, where r is equal to 0, 1, 
2, or A. 


Solution. Since n is a perfect square, there exists an integer k such that 
n= k?. By the Division Theorem, there exist (unique) integers p and s such that 


(4.21) k=T7p+s, where 0<s <7. 
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We compute that 
n=k*? =(7p+s)* 
= 7(7p* + 2ps) + 8”. 
So, if s? = 7m-+r, where 0 < r < 7, then we have that 
(4.22) n=T7q+r, where q = 7p* +2ps+m. 


That is, the remainder of n = k? after dividing by 7 is equal to the remainder of 
s’ after dividing by 7. Since 0 < s < 7, we only need to consider the remainders of 
the squares of the first 7 non-negative integers. They are 


r(0")=0, 

r(1?) = r(6”) = 1, 
r(2?) =x(5°) = 4, 
r(3”) = r(4”) =2 


Here, we used that r(s?) = r((7 — s)?). 


Exercise 4.12. Regarding perfect squares and remainders, we can summarize what 
we have seen so far in the following table: 


Table 4.1.1. Remainders of perfect squares after dividing by m for 2<m< 7. 


m | remainders of perfect squares after dividing by m 
2 0,1 

3 0,1 

4 0,1 

5 0,1,4 

vA 0,1,2,4 


Complete Table [4.1.1] form < 10 by considering the cases m = 6,8,9,10. 
Exercise 4.13. From the earlier exercises we conclude that n is not a perfect square 
if any of the following conditions hold: 

(1) dividing n by 3 yields a remainder of 2, 

(2) dividing n by 4 yields a remainder of 2 or 3, 

(3) dividing n by 5 yields a remainder of 2 or 3. 

(4) dividing n by 7 yields a remainder of 3, 5, or 6. 

Using the previous exercise, add to this list of conditions the cases of dividing n by 
m for m = 6,8,9, 10. 
Exercise 4.14. Apply the previous exercise to rule out as many integers between 
1 and 100 that you can from being perfect squares. 


For example we can rule out 2 since dividing 2 by 4 yields a remainder of 2. 
We can rule out 3 since dividing 3 by 4 yields a remainder of 3. We can rule out 
5 since dividing 5 by 3 yields a remainder of 2. 


Exercise 4.15. Prove by induction on n that for every positive integer n and 
integers x and y, we have that x” — y” is divisible by x — y. 
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4.2. There are an infinite number of primes 


Recall the marvelous Theorem |L.56) which was proved using the Division Theorem. 


Theorem 4.12 (Euclid’s Theorem). There are infinitely many primes. 


As with many fundamental results, there are multiple proofs of the infinitude 
of the number of primes. The next pair of exercises combine to present one such 
proof. 


Exercise 4.16. Suppose that there exists a strictly increasing sequence of integers 
at least 2, {an }°2,, such that any two distinct terms of the sequence are coprime. 
That is, for alln # k, gcd(an,a,) = 1. Prove that this implies that there are 
infinitely many primes. You may use the Prime Factorization Theorem, which is 
the part of the Fundamental Theorem of Arithmetic that says that any integer at 
least 2 can be written as a product of primes (including the case of being a prime). 


Exercise 4.17. Let 

(4.23) Fy := 2°41 

be the n-th \Fermat number, where n € Z2. For example, the Fermat numbers Fy, 
for0<n<6 are 

(4.24) 3,5, 17, 257, 65537, 4294967297, 18446744073709551617. 


Prove that the Fermat numbers are all coprime to each other; that is, gcd(Fy, Fr) = 
1 for alln #k. By the previous exercise, this implies that there are infinitely many 
primes. 


Hint: First prove the formula 
n—-1 
(4.25) [[®="-2 forn>1. 
i=0 
Exercise 4.18. Prove that for n > 2, the last digit of the n-th Fermat number F, 
is 7. 
Remark 4.13. The of prime numbers a,, a € Zt, is 
defined as follows: 
a, ‘= 2, 
Qn ‘= least prime factor of aja2---an_1 +1 for n> 2. 
So, for example, 
a2 = 3, 
a3 =7 since 2-3+1=7 is prime, 
a4 = 43 since 2-3-7+1= 48 is prime, 
ds = 13 since 13 is the smallest prime factor of 2-3-7-43+1 = 1807. 


As of September 2012, only the first 51 terms of the Euclid—Mullin sequence are 
known. For example, 


d2g = 643679794963466223081509857. 


We leave it to you, dear reader, to prove that the terms of the Euclid—Mullin 
sequence are distinct; that is, aj A a; for i # j. 
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4.2.1. Euclid’s bound for the n-th prime number. We can obtain a crude 
(a.k.a. coarse) bound for the size of the n-th prime using an idea related to the 
proof that there are infinitely many primes. 

For n € Z*, let p, denote the n-th smallest prime number. That is, the set 
of primes less than p, is a finite set with cardinality n — 1. For example, p, = 2, 
p2 = 3, p3 = 5, etc. 


Lemma 4.14. We have 


(4.26) Pnti S 1+ [pi =1+pip2-+-DPn- 
i=l 


Proof. Since none of pj), p2,...,Pn divide 1+ Diy pi, by the Prime Factorization 
Theorem there must exist a prime pz, where k > n+ 1, that divides 1 + DE pi- 
We conclude that 


n 


(4.27) Pntt <Pr<1t+ [[.. 
= 


Exercise 4.19. What is the smallest positive integer n for which 22") > 1 |googol? 


Using the lemma, we can prove the following (ridiculously large by Exercise 
4.19|©) upper bound for the n-th prime number. 


Theorem 4.15 (Euclid). For alln € Z*, the n-th prime satisfies the bound 
(4.28) ee gee ale 


Proof. We prove this result by strong induction. 


Base case. We have 
(4.29) pi =2<27 =4. 
So the base case is true. 


Strong inductive step. Suppose that k € Zt is such that 
(4.30) pi<2? forl<i<k. 
Then, using Lemma/[4.14] we estimate as follows: 


k 
Prot < 1+] [pv 
i=1 


k 
< 1+]]2” 
i=1 


=1+4+2 whi 2 
=14o"=2) 
ei 


By strong mathematical induction, we are done. 
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We leave it to the reader to read about (the more fancy) of Euclid’s 
Theorem that there are infinitely many prime numbers. Can you think about how 


to make Euler’s proof rigorous? 


4.3. The Euclidean algorithm 


Division and the greatest common divisor go hand in hand. The Euclidean algo- 
rithm simplifies step by step the calculation of the greatest common divisor. 


4.3.1. Basic gcd properties. The key to the Euclidean algorithm is the fol- 
lowing observation. 


Theorem 4.16. For non-zero integers a and b suppose that 
a=bq+r where q,reé€Z. 
Then 
(4.31) gcd (a, b) = gcd (8,1). 
This is particularly useful whena >b>r > 0. For in this case, among the three 
integers, effectively the largest integer is replaced by the smallest integer. 
Proof. Suppose that d is a common divisor of a and b. Then, by Theorem we 
have that d divides the following integral linear combination of a and b: 
a:1+b-(-—q)=a-—bq=r, 
which is the remainder. Hence d is a common divisor of 6 and r. This implies that 
gcd(a, b) < gced(b,r). 
Conversely, if d is a common divisor of b and r, then d divides the following 
integral linear combination of b and r: 


b-qtr-l=bq+r=a. 
Hence d is a common divisor of b and a. This implies that gcd(b,r) < ged(a, b). 
We conclude that gcd(a, b) = gcd(b,r). 


Given non-zero integers a > 6, if we use the Division Theorem to divide a by 
b to get a remainder r with 0 < r < b, then Theorem says that the greatest 
common divisor of a and b remains unchanged if we replace a by the remainder r. 
This makes the problem of finding the gcd easier since r < b < a. 


Lemma 4.17. Suppose that a € Z and b € Z* are such that b divides a. Then 
gcd (a, b) = b. 


Proof. Since b divides itself and since b divides a by hypothesis, b is a common 

divisor of a and b. So b < gced(a,b). On the other hand, by Lemma [Z.8] and since 

b > 0, we have gcd(a,b) < b. We conclude that gcd (a, b) = b. 

Example 4.18. (1) We have that 19 divides 114. Thus gcd(114, 19) = 19. 
(2) We have 114 = 36-3+6. Therefore gcd(114, 36) = gcd(36, 6) = 6. 


4.3.2. The Euclidean algorithm. Now we can proceed to demonstrate the 


Euclidean algorithm 
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4.3.2.1. An example of the Euclidean algorithm. Consider the problem of finding 
the gcd of 936 and 324. By applying the Division Theorem repeatedly, we calculate 
that 


(4.32a) 936 = 324-2 + 288, 
(4.32b) 324 = 288-1436, 
(4.32c) 288 = 36-8 +0. 


By Theorem [4.16] we conclude that 
(4.33) gcd(936, 324) = ged(324, 288) = ged(288, 36) = 36, 
where the last equality is because 36 divides 288. 
612 


oe ++ 4++_+-4 ° 
0 36 288 324 936 


Figure 4.3.1. How is this diagram a visualization of the Euclidean algorithm? 
Note that counting backwards from 936 by 324’s yields 612 and 288. Counting 
backwards from 324 by 288 yields 36. 


Exercise 4.20. Use the Euclidean algorithm to compute the greatest common di- 
visor of 812 and 188. 


Exercise 4.21. Use the Euclidean algorithm to compute the greatest common di- 
visor of 744 and 258. 


4.3.2.2. How the Euclidean algorithm is proved. Now let us consider the Eu- 
clidean algorithm for a general pair of positive integers a > b. From the example 
above, we see that we successively find remainders of dividing a larger positive num- 
ber by a smaller positive number. After each step, the larger of the two integers is 
replaced by the remainder, which is smaller than the smaller of the two integers. 
When does this all stop? We claim that eventually we get a remainder of zero! 


Since the Euclidean algorithm produces a sequence of integers, we are going to 
put subscripts on the letters denoting the integers. Call, for the fun of it (okay, 
maybe this is not so fun, but you will see the pattern!), a =: r_1, b =: ro, and their 
remainder r =: r;. Soa = b¢g+r,0<r <b, becomes 


(4.34) T1=Trogd +11, 0<m< FOs 


where we also called q =: q,. If; = 0, then we stop and celebrate that gcd(a, b) = b. 
Otherwise, assuming 7; > 0, the next application of the Division Theorem yields 
the existence of integers q2 and r2 satisfying 


(4.35) ro=Mgtre, OS Te <7. 
If r2 = 0, we celebrate, congratulate ourselves, and observe that 
(4.36) gcd(a, b) = ged(ro,ri) = 171. 


It is rather clear that if we continue in this way, we obtain a sequence of non-negative 
integers 


(4.37) @=7Tr_1~>b=rm>r=mM1>7T2>73>°::, 
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where we stop when r; = 0 for some k. For such a k, we have rz_2 = rp—1 qx and 
hence 


(4.38) gcd(a, b) = ged(ro, 71) = ged(r1, 12) = +++ = ged(rp_2, 7-1) = 7p-1 > 0. 


This is the Euclidean algorithm for finding the gcd of a and b. 


Now the 20 million dollar question (inflation!) is: How do we prove that there 
exists a positive integer / such that r, = 0? That is, we actually do stop. 


To see that we must stop, we think intuitively: A priori (with some observa- 
tion we can do better in terms of the number of steps), the worst case scenario 
(the slowest sequence arriving at zero) is when the sequence of remainders is the 
arithmetic sequence 
(4.39) 

a=r_1y~>a-l=r>a-2=7 >a-3=72>°::->1=%-2>0=7e-1, 


so we must have the existence of such a k with k < a—1. Now, as long as r; is 
not 0, the algorithm produces a strictly smaller, but still positive, r;,,. So, a list 
of such r; has to reach 0 within a steps by the argument above. 


Summary. For the gcd of two distinct positive integers, we can divide the 
larger integer by the smaller integer to get a remainder. The gcd is equal to the gcd 
of the smaller integer and the remainder. In this way we can step by step reduce 
the size of the integers. Eventually we get a zero remainder. In this last case, the 
smaller number divides the larger number, and hence the gcd equals the smaller 
integer. 


4.3.3. Reversing the Euclidean algorithm. An additional benefit of the Eu- 
clidean algorithm is that by reversing it we can write the gcd of two integers a and 
b as an integral linear combination of a and b. 


4.3.3.1. Example of the gcd as an integral linear combination. Let us see how 
this works by considering the example in the previous subsection. 


By (432b), we have 
(4.40) 36 = 324 — 288. 
By (432a), we have 
(4.41) 288 = 936 — 324-2. 
Substituting this into the previous equation yields 


(4.42) 36 = 324 — 288 
= 324 — (936 — 324-2) 
= (—1) - 936 + 3-324. 


That is, gcd(936, 324) = 36, and 


(4.43) (—1)936 + (3)324 = 36. 
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4.3.3.2. The gcd is always an integral linear combination. In general, given two 
positive integers a > ), reversing the Euclidean algorithm finds us two integers m 


and n satisfying the linear Diophantine equation 
(4.44) ma+nb= gq, 


where g is the gcd of a and ). In our example, where a = 936 and b = 324, we found 
the solution given by the pair m = —1 and n = 3. 


Exercise 4.22. Write gcd(812, 188) as an integral linear combination of 812 and 
188. 


Exercise 4.23. Write gcd(744, 258) as an integral linear combination of 744 and 
258. 


Recall from Definition [51] that integers a and b, not both zero, are coprime 
if ged (a,b) = 1. Breaking down what we have stated above, we have: 


Theorem 4.19 (The gcd is an integral linear combination). Let a and b be integers, 
not both zero. Then there exist integers m and n satisfying 


(4.45) ma + nb = gcd(a, b). 
In particular, if a and b are coprime, then there exist integers m and n satisfying 


(4.46) ma+nb= 1. 


Proof. We use the notation r9 = a and r; = b as above. Let rzy_, = g be the last 
non-zero remainder as above, where g is the greatest common divisor of a and b. 
By reversing the Euclidean algorithm step by step, we firstly have that r,_, is an 
integral linear combination (i.lc.) of r,-3 and rg_2. Secondly, rg—2 is an i.Lc. of 
rp—_a and rp_3. Thus, rg_; is ani.lc. of rz_4 and rz_3. Continuing in this way, we 
finally see that r,_1 = g is an il.c. of ro = a and r; = 6 (a rigorous proof of this 
uses induction). 


Exercise 4.24. By reversing the Euclidean algorithm, find integers m and n sat- 
isfying 


(4.47) 83m + 17n = 1. 


4.4. Consequences of the Division Theorem 


As one might expect, the Division Theorem is fundamentally important in the 
study of (positive) integers from the point of view of multiplication. In particular, 
it is important in the study of prime numbers, the greatest common divisor, the 
property of two integers being coprime, divisors, etc. In this section we discuss 
some of the further consequences of the Division Theorem. 
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n 


10 13 16 19 22 25 28 31 34 37 40 


Figure 4.3.2. At each point (m,n) the integral linear combination 3m+5n is 
displayed right below it. For example, —19 is displayed right below the point 
(—3,—2). Since 3 and 5 are coprime, every integer may be represented as an 
integral linear combination. 


4.4.1. Characterization of two integers being coprime. One characterization 
of a and b being coprime is that there does not exist a prime that divides both 
a and 6. Here is another important characterization, which is mainly because of 
Theorem |4.19}on being able to express the gcd as an integral linear combination. 


Theorem 4.20 (Characterizing coprime integers). Non-zero integers a and b are 
coprime <> the integer 1 can be written as an integral linear combination of a and 


b. 
Proof. (=) This follows immediately from Theorem [4.19 
(<=) Suppose there exist m,n € Z such that 
mat+nb=1. 


Let c be any positive common divisor of a and b. Since c divides a and b, we 
have c divides ma+ nb = 1. This and c > 0 imply c = 1. We conclude that 
gcd (a,b) = 1. 


Summary. In other words, if c = g is the gcd of a and b, then there exist 
integers m,n such that (m)a+ (n)b = c. Conversely if there exist integers m,n 
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such that (m)a + (n)b = c, where c > 0, then since g divides c, we have g < c. In 
particular, if c= 1, then g = 1. 


Exercise 4.25. Let p and q be prime numbers. Prove that p and q are not equal if 
and only if there exist integers m,n such that 

(4.48) mp+ngq=1. 

Exercise 4.26. Prove that for every integer n, 

(4.49) gcd(5n + 2,12n4+5) =1. 

Hint: Use Theorem |4.20 


Exercise 4.27. Let us generalize the previous exercise. Let a,b,c,d be integers 
satisfying ad — be =1. Prove that for every integer n, 


(4.50) gcd(an + b,cn + d) = 1. 
Is this really a generalization of the previous exercise? Explain. 


Solved Problem 4.21 (Integers are coprime if and only if they have no prime 
common divisors). Prove that non-zero integers a and b satisfy gcd(a, b) = 1 if and 
only if there does not exist a prime p that divides both a and b. 


Solution. (=) Considering the contrapositive, suppose that there exists a 
prime p that divides both a and b. Then gcd(a,b) > p > 1, so that gced(a, b) £1. 

(<) Considering the contrapositive (again!), suppose that g := gcd(a,b) # 1. 
Then g > 1. By Theorem 2.15] there exists a prime p dividing g. Since g divides 
both a and b, by the transitivity of division this implies that p divides both a and 
b. 


Solved Problem 4.22. Suppose that non-zero integers a,b,c,d satisfy 
(4.51) gcd(a, b) = gcd(a, d) = gcd(c, b) = gcd(c, d) = 1. 


(We can think of this hypothesis as saying that the two integers in ac have nothing 
in common multiplicatively with the two integers in bd.) Prove that 


(4.52) gcd(ac, bd) = 1. 
Solution. Suppose for a contradiction that gcd(ac,bd) 4 1. Then, by the 


previous problem, there exists a prime p that divides both ac and bd. Thus, by 
Lemma |4.26 


(p divides a or p divides c) and _ (p divides b or p divides d) . 


By considering the four resulting cases, we see that this contradicts (4.51). 


Here is an alternative proof. By hypothesis, a and b are coprime, and a 
and d are coprime. Hence, by Lemma [4.29] we have that a and bd are coprime. 
Similarly, we see that c and bd are coprime. 


Now, since bd and a are coprime and since bd and c are coprime, by Lemma 
[4.29] we have that bd and ac are coprime. 


4.4. Consequences of the Division Theorem 179 


4.4.2. Division of products and coprimeness. Consider the situation of an 
integer dividing the product of two integers. The following gives us a criterion for 
when the integer will divide one of the two factors. 


Theorem 4.23. Suppose a,b,c € Z* anda and b are coprime. If a divides be, 
then a divides c. 


Note that the converse of this implication is true by Exercise [1.23] 


Proof. The idea is to use that a and b are coprime. By Theorem [4.20] there exist 
m,n € Z such that 
1=am+bn. 
The key is to link this to c. The idea is simply to multiply this equation by c to get 
c=cam+cbn. 
Now a clearly divides cam since a: cm = cam. On the other hand, since a divides 


bc by hypothesis, we have that a divides cbn since cbn = bc-n. Hence a divides the 
sum cam + cbn. Since cam + cbn = c, we conclude that a divides c. 


This is an elegant and important proof. 


Interpretation. Intuitively the statement says that if a “goes completely into 
the product” bc while a has “nothing to do with” b, then a must “go completely 
into” c. 

Taking advantage of the fact that primes have either all or nothing in common 
multiplicatively with other integers, we can deduce the following. 


Corollary 4.24. If a prime p divides the square a? of an integer, then p divides a. 


Proof. Since p is a prime, we have gcd(a, p) is equal to 1 or p. 
Case 1: gcd(a,p) = p. This implies that p divides a, and we are done. 


Case 2: gcd(a,p) = 1. We can apply Theorem [4.23] to the hypothesis that p 
divides a-a to conclude that p divides a, where this a is the second a in a-a (and 
we used gcd(a,p) = 1 for the first a in a- a). 


In either case, we have p divides a. 


Remark 4.25. Here is another argument to prove Corollary [4.24] Can you fill 
in the details? We can write a as a product of primes: a = pip2---p,, where the 
primes p; might not be distinct. Then a? = p7p3---p?. Since p divides a?, we must 
have that p = p; for some 7. Therefore p divides a. 


We have the following generalization of Corollary [4.24] 
Lemma 4.26. If a prime p divides a product ab, then p divides a or p divides b. 
Exercise 4.28. Prove Lemma [4.26] 


Solved Problem 4.27. For each part of this problem, if you use a result about 
division, state the result. 


(1) Suppose 12 divides 15k, where k is an integer. Prove that 4 divides k. 
(2) Suppose that gcd(9, ab) > 1. Prove that (8 divides a) or (3 divides b). 
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Solution. (1) Division fact (Theorem[4.23): If a divides bc and if gcd(a, b) = 1, 
then a divides c. Since 12 divides 15k, we have that 4 divides 5k. Since gcd(4,5) = 1 
and by the aforestated division fact, we conclude that 4 divides k. 


(2) Division fact (Lemma [4.26): If a prime p divides a product of integers 
ab, then (p divides a) or (p divides b). We apply this fact with p = 3. Since 
g := gcd(9, ab) > 1 and since g divides 9, we have that g equals 3 or 9. In particular, 
3 divides g. Since g divides ab, by the transitivity of division this implies that 3 
divides ab. Finally, by the aforestated division fact, we conclude that (3 divides a) 
or (3 divides b). 


Solved Problem 4.28 (If a prime divides a positive power of an integer, then 
it divides the integer). If a prime p divides a”, where a is an integer and n is a 
positive integer, then p divides a. 


Solution. We prove this by induction on n. To be more formal this time, we 
explicitly mention that the statement P(n) that we want to prove for all n > 1 is: 


For any prime p and integer a, if p divides a”, then p divides a. 
Clearly the base case P(1) is true. 


As the inductive hypothesis, suppose that k is a positive integer such that P(k) 
is true; that is, for every prime p and integer a, if p divides a*, then p divides a. 

Now let p be a prime and let a be a positive integer such that p divides a*t?. 
Since we may think of this as saying that p divides the product a*a, since p is 
a prime and by Lemma [4.26] we have that p divides a® or p divides a. By the 
inductive hypothesis, we have that p divides a* implies that p divides a. Thus, we 
conclude that p divides a. This proves P(k + 1) and hence proves the inductive 


step. 


By induction, we have proved P(n) for all n > 1. 


The following says that if an integer has nothing in common multiplicatively 
with each of two integers, then that integer has nothing in common multiplicatively 
with the product of those two integers. 


Lemma 4.29. Ifa and b are coprime and if a and c are coprime, then a and bc 
are coprime. 


Proof. Suppose gcd(a,b) = 1 and ged(a,c) = 1. Let d be a positive common 
divisor of a and bc. Then, by Exercise [1.31] we have 


(4.53) gcd(d, b) < gced(a, b) = 1, 
which implies that gcd(d,b) = 1. Now, by Theorem [4.23] since d divides bc and 
since gcd(d, b) = 1, we have that d divides c. This and using that d > 0 divides a 


implies that d = 1 since a and c are coprime. Finally, d = 1 implies that a and bc 
are coprime. 


Remark 4.30. An alternative, elegant proof of Lemma [4.29] uses the characteri- 
zation of Theorem for being coprime. Namely, by the hypotheses, there exist 
integers x,y, Z,w such that 


1= az + by, l=az+cw. 
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By multiplying(!) these two equations together, we obtain that 


1 = (ax + by)(az + cw) 


= aaxz + acxw + abyz + beyw 


= a(axz + caw + byz) + bc(yw). 
By Theorem [4.20] this implies that a and bc are coprime. 


Remark 4.31. Here is yet another argument to prove Lemma [4.29] Can you fill 
in the details? Assuming the hypothesis, suppose for a contradiction that a and bc 
are not coprime. Then there exists a prime p that divides both a and bc. Since p 
divides bc, we have that p divides b or that p divides c. 


Exercise 4.29. Let a,b,c be non-zero integers. Prove that if a and be are not 
coprime, then a and b are not coprime or a and c are not coprime. 


4.4.3. Division of products and the gcd. On the other hand, suppose we have 
the same situation as above, except that a has “multiplicatively something to do 
with” b, such as a= 6 and b= 15. That is, suppose that a and 6 are not coprime. 
Then we can take for example c = 14 (an even number) and we get that 


a = 6 divides b-c = 15-14 = 210 


but a = 6 does not divide c = 14. What happens is that part of 6 (namely 3) goes 
into b = 15 whereas the other part of 6 (namely 2) goes into c = 14. More generally, 
we have the following result. 


Theorem 4.32. With Z as the universal set, suppose that a non-zero integer c 
divides a product ab. Let g = gcd(c,a). Then 


(4.54) «divides b. 
g 
The converse of this implication is also true. 


Observe that Theorem [4.23] is the special case where g = 1. 


Proof. By the hypothesis, it follows that the integer : divides the product of 
2) = 1 by Theorem[I.50] we conclude by Theorem[4.23] 


integers : -b. Since gcd (<. 5 
that . divides b. 
To see the converse, suppose that > divides b. Then c divides gb. Since g 


divides a, this implies that c divides ab. 


Exercise 4.30. Prove that for everyn € Z*, 
(4.55) gcd(a”, b”) = ged(a, b)”. 


Hint: For showing that the assumption that gcd(a,b)” < gcd(a”,b") leads to a 
contradiction, use Solved Problem [4.28] 
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4.4.4. Coprime integers dividing an integer. Suppose that 5 and 6 both divide 
an integer x. Then we know that 30 divides x. The reason is the following result. 


Proposition 4.33. Suppose that m, and mz are coprime positive integers. If m4 


and mz both divide an integer x, then their product m mz divides z. 


Proof. Since m, divides x, we have that x = mj ,y for some integer y. Now, since 
mg divides the product my y and gcd(m1, m2), we have by Theorem that me 
divides y. Therefore there exists an integer z such that y = m2z. We conclude that 


= MY = ™MIINQZ, 


so that m mz divides x. 


On the other hand, if 6 and 10 both divide an integer x, then 30 divides x. The 
reason is now the following. 


Exercise 4.31. Generalize Proposition [4.33] by proving that if positive integers my 
and mz both divide an integer x, then mymg/g divides x, where g = gcd(my,, mz). 


We remark that m,m2/g is the least common multiple of m, and m2; see 
below. 


By induction, we can extend the proposition to the following. 


Corollary 4.34. Suppose that m1,m2,...,Mz are pairwise coprime positive inte- 
gers. If each of m,,mg2,...,M, divides an integer x, then their product mym2-+-Mz 
divides x. 


Exercise 4.32. Prove Corollary by induction. 
Example 4.35. If 7,11,13 each divide an integer x, then 1001 divides x. 


4.5. Solving linear Diophantine equations 


Figure 4.5.1. (~207-~291). 


In general, given integers a, ), and c, we want to find all integer solutions m 
and n to the equation 


(4.56) am + bn = c. 
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We call this a linear Diophantine equation, This is the simplest non-trivial 


type of equation over the integers. 


For example, consider the equation 


(4.57) 4199m + 1748n = 95, 

where we would like to find all integer solutions m and n to this equation. We have 
(4.58) gcd (4199, 1748) = 19, 

and by reversing the Euclidean algorithm, one sees that a solution to 

(4.59) 4199m + 1748n = 19 

is given by m = 5 and n = —12. Multiplying all of this by 5, we find that a 
particular solution to (4.57) is given by 

(4.60) m=25 and n=-—60. 


4.5.1. Necessary and sufficient condition for the existence of a solution to 
a linear Diophantine equation. In general, the consequence above of reversing 
the Euclidean algorithm gives us the following existence result for solving linear 
Diophantine equations. 


Theorem 4.36. Let a,b,c € Zt. Then there exist m,n € Z such that 
(4.61) am + bn =c 
if and only if gcd (a,b) divides c. 
That is, given positive integers a,b,c, we can solve the equation am + bn = c 
for some pair of integers m,n if and only if c is divisible by gcd (a, b). 
For example, we can solve the equation 
4199m + 1748n = c 


for integers m and n if and only if c is divisible by 19. This is why we were able 
to find a solution when c = 95. On the other hand if 77 < c < 94 (76 is the next 
multiple of 19), then the equation 4199m + 1748n = c has no integer solutions m 
and n. 


Proof of Theorem [4.36] (<) Suppose ged (a,b) divides c. By Theorem [4.19] 
there exist m,n € Z such that 


gcd (a, b) = am + bin. 
Since gcd (a,b) divides c, there exists gq € Z such that 
c = gcd (a,b) - gq. 
Thus 
c= gcd (a,b) -¢q 
= (am + bn) q 
=a(mq) + (nq). 


Taking m = mq and n = fig yields am + bn = c. 
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(=) Suppose there exist m,n € Z such that am + bn = c. Since ged (a,b) 
divides both m and n, it divides the integral linear combination am + bn. Now, 
since am + bn = c, we conclude that gcd (a, b) divides c. 


So we know when the equation am + bn = c has at least one solution. Now, 
when it has a solution, how do we describe all solutions? 


Example 4.37. We have that mo = 1 and no = 2 solve the linear Diophantine 
equation 60 = 24mp + 1879; that is, 
60 = 24-14 18-2. 
We can now describe all solutions. Suppose that m and n also solve 
60 = 24m + 18n. 
Then, by taking the difference of the two displayed equations above, we get 
0 = 24m + 18n — (24-14 18-2) 
= 24(m—1)4+18(n—-2). 


This equation is equivalent to the one obtained by dividing by gcd (24,18) = 6; 
namely 

—3(n— 2) =4(m—1). 
Since gcd (3,4) = 1 and since 3 divides 4(m— 1), by Theorem we conclude 
that 3 divides m— 1. Hence there exists an integer k such that 


m—1= 3k. 
By substituting this display into the previous display, we obtain 
—3(n — 2) =4(m—1) =4-3k, 
so that (by cancelling the 3 factors) 
n—2=-—Ak., 
We have shown that if m,n is any solution, then we must have that 
m=1+4 3k, 
n = 2-— Ak, 
where k is an integer. 


Now we have to check that m and n of the form above are actually solutions. 
That is, conversely, m = 1+ 8k and n = 2 — 4k is a solution for every integer k. 
Indeed, we check that for all integers k: 


24(1 + 3k) + 18(2 — 4k) = 24 + 72k + 36 — 72k = 60. 


Summarizing, we have shown that m and n are integer solutions to 60 = 24m+ 
18n if and only if 


m=1+4 3k, 
n = 2—A4k, 


where & is an integer. 
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Exercise 4.33. (1) Find a particular solution to the linear Diophantine equation 
812m + 188n = 28. 


(2) Find a particular solution to the linear Diophantine equation 744m + 258n = 
18. 


4.5.2. How to solve the general linear Diophantine equation. By following 
the method in Example[4.37] we can now describe how to solve any linear Diophan- 
tine equation. Given positive integers a, b, and c, consider the linear Diophantine 
equation 

am+bn=c. 
Let g = gcd(a,b). We assume that g divides c; otherwise, there are no solutions. 
By Theorem [4.19] there exist m,n € Z such that 
g=am+bn. 
Since gcd (a, b) divides c, there exists q € Z such that c = gq. Thus 
c=a(mq) + b(fq). 
Taking mp = mq and np = nq yields a “particular solution”: 
amg +bng =c. 
Now we find the general solution m and n. Suppose that am + bn = c. Then 
subtracting the display above from this, we obtain the “homogeneous equation” 
a(m— mo) + b(n — 19) = 0. 
We divide this equation by the gcd g to get 
a 
rae) = ——(n—1N9). 


Since gcd (s. 2) = 1 and since a divides = (n—ng), conclude from Theorem [4.23] 
that 3 divides n — ng. So there exists an integer k such that 


a 
n—1no = —k. 


Plugging this display into the previous display, we obtain 


a ba 
—(m—mo) =—--—k, 
g gg 
so that 
b 
m—-mj=——k 
g 


We conclude that: 


Theorem 4.38. [fm = mo and n = no is a particular solution to the linear 
Diophantine equation am + bn =c, then the general solution is given by 


n=n + —k, 


b 


m=mo——k, 


where k is an integer. 
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Conversely, if m and n are given as above, then we calculate that 


b 
am+bn=a(mo— 2k) +(no+ 2h) 
g g 


ee me a ae 
g g 


=amo +bno 


= Cc. 


Therefore, the general solution to the linear Diophantine equation am + bn = c is 
given by the display above the previous display. 


b a 
(mo — gro + a) 


a/g 


am+bn=c 


aa 


Figure 4.5.2. Visualizing the general solution to a linear Diophantine equa- 
tion. Starting with a particular solution (mo,no) (the red dot), the general 


solution is obtained by adding integer multiples of (-2, 2) to get the blue 


dots. Note that the slope of the line is equal to —a/b. 


Solved Problem 4.39. Find all solutions to the linear Diophantine equation 


(4.62) 140m + 63n = 35. 


Solution. We first find gcd(140, 63) by the Euclidean algorithm: 


140 = 63-2414, 
63 = 14-447, 
14=7-2+4+0. 


Thus gced(140,63) = 7. Since 7 divides 35, we know that the linear Diophantine 
equation has a solution. We now proceed to find all solutions. 
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By reversing the Euclidean algorithm, we have 
7=63+4 14. (—4) 
= 63 + (140 + 63 - (—2)) - (—4) 
= 140- (—4) + 63 - 9. 
Thus a particular solution to our linear Diophantine equation is given by mp = —20 
and no = 45. Since 7 = 20 and S = —9, the general solution to (4.62) is given by 


m=—20-9k, n=45+ 20k, 


where k € Z. 


Exercise 4.34. (1) Find the general solution to the linear Diophantine equation 
812m + 188n = 28. 


(2) Find the general solution to the linear Diophantine equation 744m+258n = 18. 


4.6. “Practical” applications of solving linear Diophantine equations 
(wink ©) 


In the movie |Die Hard with a Vengeance, Simon asks John McClane and Zeus 


Carver to solve the following linear Diophantine equation in the guise of a|water! 
pouring puzzle 


All you have is a 3-gallon jug, a 5-gallon jug, and an unlimited 
supply of water. Engineer it so that you have exactly 4 gallons of 
water in the 5-gallon jug. See Figure |4.6.1 


Figure 4.6.1. A 3-gallon jug and a 5-gallon jug. We wish to fill the 5-gallon 
jug with 4 gallons of water. 


Here is a solution: 


Fill the 5-gallon jug with water. Pour water from the 5-gallon jug into the 3-gallon 
jug until it is full. This leaves 2 gallons of water in the 5-gallon jug. Empty the 
3-gallon jug. Pour the 2 gallons of water from the 5-gallon jug into the 3-gallon 
jug. Now fill the 5-gallon jug with water again. Pour water from the 5-gallon jug 
into the 3-gallon jug until it is full. Since the 3-gallon jug had exactly 2 gallons of 
water, exactly 1 gallon of water was poured from the 5-gallon jug, so that exactly 
4 gallons of water remains in the 5-gallon jug. Sigh of relief! 


Now suppose that the supply of water is unlimited, but it costs $10/gallon. 
How do you minimize the number of gallons you need to buy? Hint: The solution 
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above uses 10 gallons, which costs $100! Can you think of a (not much) cheaper 
way to do this by first filling the 3-gallon jug with water? 


By working backward, here is a solution to a linear Diophantine equation cor- 
responding to the first solution above: 


4=5-1 
=5- (3-2) 
=5-(3— (5—8)) 
=2-5+(-2)-3. 


4.7*. (Polynomial) Diophantine equations 


Generalizing linear Diophantine equations, a|/Diophantine equation is a polynomial 


equation where we are looking only for integer solutions. 


For example, recall from (3.80) that a Pythagorean triple is a positive integer 
solution (2, y, z) to the Diophantine equation 


(4.63) way =2". 


A famous example of a Diophantine equation is the equation of Fermat’s Last 
‘Theorem: 


(4.64) eo” ty" = 2", 


where n > 3, x, y, and z are positive integers. 


Figure 4.7.1. |Marie-Sophie Germain) (1776-1831). Wikimedia Commons, 


Public Domain. 


Over the span of a few hundred years, many mathematicians contributed to 
the solution of Fermat’s Last Theorem, including, just to name a few: Fermat, 
Germain, Klein, Fricke, Hurwitz, Hecke, Dirichlet, Dedekind, Langlands, Tunnell, 
Deligne, Rapoport, Katz, Mazur, Igusa, Eichler, Shimura, Taniyama, Frey, Bloch, 
Kato, and Selmer. 
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The conjecture of Fermat was finally solved by [Andrew Wiles and 
Namely, they proved: 


Theorem 4.40 (Fermat’s Last Theorem). There do not exist positive integers 
n> 3,2, y, and z satisfying the equation 


(4.65) fy” = 2". 
This is a beautiful and simple statement, but it was remarkably difficult to 
prove. 
Exercise 4.35. Using a calculator or computer, calculate 
(4.66) (39871? + 436512)1/12, 


Is your answer an integer? 


Without relying on Fermat’s Last Theorem (unless you know its proof!), prove 
that 


(4.67) 39871? + 4365/2 4 4472)”. 
Hint: Consider the remainders after division by 4. 


One can look at special cases of Fermat’s Last Theorem and hope that they 
are easier to prove. For example, Fermat himself proved the following. 


Theorem 4.41. There do not exist positive integers x, y, and z satisfying the 
equation 


(4.68) ai ty* = 24, 


a? 


Figure 4.7.2. Theorem [|4.41]says that there cannot be a right triangle with 
leg lengths x? and y? and hypotenuse length z?, where x,y,z are positive 
integers. If such a triangle existed, the Pythagorean theorem would say that 
(x?)? + (y?)? = (z?)?. Replacing 2? by w corresponds to Proposition [4.42 


This result is a corollary of the following result (for simply take w = 2? in the 
display below). 


Proposition 4.42. There do not exist positive integers x, y, and w satisfying the 
equation 


(4.69) at +y* =w’. 
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Proof. Let S be the set of all Pythagorean triples where each of the legs is a perfect 
square: 


S:= 1a a ao) le” +y*= wm”). 
Let T C Zt be the set of all hypotenuses of Pythagorean triples in the set S: 
T = {w|(2’,y’,w) € St. 


Suppose for a contradiction that the proposition is false. Then S is non-empty, and 
hence T is a non-empty subset of Z*. By the well-ordering principle (Axiom [4.4), 
T has a minimum element w. By the definition of T,, there exist positive integers 
x and y such that (x?, y?, w) is a Pythagorean triple. 


We have that x? or y? is odd, since otherwise w could not be the minimum 
element of T. Thus, without loss of generality, we may assume that 2? is odd; 
that is, x is odd. By Lemma [3.56] this implies that y is even. Furthermore, by 
minimality of w in T, it is easy to show that the Pythagorean triple (x?, y?, w) is 
primitive. So, by Lemma [3.59] we have that there exist positive integers m > n 
such that (Euclid’s formula) 


(4.70) g=m—n?, y2=2mn, w=m?4+n?. 


In particular, x? +n? = m?, so that (x,n,m) is a Pythagorean triple. Now we may 
apply Lemma|3.59}again to obtain that there exist positive integers g > r such that 


(4.71) a=q—r’, n=2qr, m=q' +r’, 


where g and r are coprime and one is even and one is odd (since x is odd). Fur- 
thermore, by m = q? + r?, we have that m and q are coprime and m and r are 
coprime. 


By and (471), we have y? = 2mn = 4grm, so that grm = (4)? is a 
perfect square. Since the integers q,r,m are pairwise coprime, we conclude that 
each is a perfect square. That is, there exist integers a,b,c such that q = a?, 
r= 07, and m=’. So, by q? +r? =m, we have that (a”,b?,c) is a Pythagorean 
triple. Observe that the Pythagorean triple (a, b?,c) has hypotenuse less than the 
hypotenuse of the Pythagorean triple (x?, y?, w) since 


C<C=am<m <m+n2=w. 


This contradicts that w is the minimum element of T. 


4.8. The Fundamental Theorem of Arithmetic 


Recall that Theorem [2.15] says that any integer n > 2 is prime or is a product 
of primes. We upgrade this result by adding to it the uniqueness of the prime 
factorization: 


Theorem 4.43 (Fundamental Theorem of Arithmetic). Any integer n greater than 
1 can be uniquely written as the product of primes in non-decreasing order. 


That is, there exist primes py < po <--+ < pr such that 


(4.72) M = Pip2*** Pr. 
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Moreover, if 


(4.73) m= q192°"* Qe, 
where q, < q2 <-+:<qe are primes, then k = ¢ and p; =q for1<i<k. 


Proof. The Prime Factorization Theorem yields the existence part of the 
statement, so we are just left to prove the uniqueness part. 


A way to simplify things is to cancel all|p’s and q’s| that are equal In view of 
this, one argues that if uniqueness fails, then one can assume that the p’s and q’s are 
distinct from each other; that is, for all 7,7, p; 4 q;. We then obtain a contradiction 
because taking, say, p1, since p; divides the product of the q’s (exercise: explain 
why), it must divide one of the q’s, which means that p; must equal one of the q’s. 


We expressed the proof in this way for conceptual clarity. A formal proof would 
take more effort. 


4.9. The least common multiple 


On the flip side of the greatest common divisor, we have the least common multiple. 
Definition 4.44. Let a and b be non-zero integers. The least common multiple 
é£ = |cmfa,}] of a and b is the unique positive integer ¢ satisfying the following 
properties: 


(1) €is a common multiple of a and b; that is, a divides @ and b divides ¢. 


(2) @ is less than or equal to any positive common multiple of a and 6; that is, if 
m is a positive common multiple of a and b, then @ < m. 


Let S be the set of positive integers that are common multiples of a and b. 
That is, 
S={m€Z* :a divides m and b divides m}. 


We have that |ab| € S, so that S is non-empty. By the well-ordering principle, we 
have that S has a least element. This element is the lem of a and b. Since the 
least element of S' is unique, the least common multiple @ = lcm[a, b] of a and b is 
unique. 


We now sketch a proof of the formula: For any positive integers a and b, 
(4.74) gcd (a, b) - lem [a, b] = ab. 
Can you fill in the details? 

Denote ¢ := lcm[{a, b] and g := gcd(a, b). 


Assertion As Small: 


b 
(4.75) > <9. 
Assertion As Big: 
b 
(4.76) > > 9. 


?For example, if piz7 = q23 = q24, then we choose one of q23, gz24 to cancel with pi7. 
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Why it’s as small: 


e For any common multiple m of a and 6, @ divides m, for if 2 does not divide 
m, then by the Division Theorem there exists q,r € Z such that 


m=q-:l+r, where0O<r<@. 


Then, r is a common multiple of a and b, and hence r > @, a contradiction. 


e ¢ divides ab. Thus, 2 is a positive integer. 


g 
° ap is a common divisor of a and b. E.g., explain why ab divides af is true and 
why that helps. 
Why it’s as big: 
= is acommon multiple of a and b. 


Explain why this is easy to prove and why this is all that we need! 


“| 
Figure 4.9.1. We divide m by ¢ = lcm|a, }] to obtain a remainder r. Since a 
divides both £ and m, we have that a divides r. Similarly, b divides r. Since 


r < £, we conclude that r = 0. That is, 2 divides m. This shows that the least 
common multiple divides any common multiple. 


4.9.1. The gcd and Icm via the prime decomposition. We can see the rela- 
tionship between the greatest common divisor and the least common multiple of two 
integers by consideration of the prime decompositions of the integers. Let N > 2 be 
an integer. By the Fundamental Theorem of Arithmetic, we may (uniquely) write 


(4.77) m= pips? + De", 
where pj < pg <-:+: <p, are prime and k,,ko,...,k, are positive integers. 

Now, let m and n be integers, both of which are at least 2. Then there exist 
primes qi < q2 <--- < qs and non-negative integers ky, ko,...,k; and 01, 2,..., 
such that 


(4.78) n = ght gk? ... qh and m= did? .-.q’. 
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Can you prove the following formulas for the greatest common divisor and the 
least common multiple? 


(4.79a) icel (te) ot fiiaiiaie ie i er mec 
(4.79b) lem[n, m] = pith ant haitas abd gqmextharts} 


Using this, can you explain again why 
gcd(n,m) -lem[n, m] = nm? 


You may need to prove a little lemma about max and min! 


4.10. Residues modulo an odd prime 


We are particularly interested in division by primes. In this section we consider 
elementary consequences of the Division Theorem in this setting. 
Residues are siblings of remainders. Let p be an odd prime, and define 
p-1 


b=, 


which is an integer. 
Observe that, corresponding to p = 7, we have that for any integer a there exist 
unique integers g and r such that 
a=T7q+r, —3<r<3. 
We call r the residue of a modulo 7. Generalizing the fact above, we have the 
following. 


Exercise 4.36. Prove that for every integer a, there exist unique integers q and r 
such that 

(4.80) a=pqtr, —b<r<b. 

Hint: Apply the proof of the Division Theorem. You only will need that p is odd (it 
is not necessary for p to be a prime). 


Note that if we use the Division Theorem verbatim, then we obtain a quotient 
qd and a remainder r’ with 0 <r’ < p—1= 2b. This exercise says that we can 
effectively shift the remainder interval from [0, 2b] to [—, b]. 


Let a be an integer which is not a multiple of p. Consider the first b positive 

multiples of a: 
la, 2a, 3a,..., ba. 
For 1 <i < b, let —b < r; < b be the unique remainder of ia given by (4.80), so 
that 
(4.81) p divides ia — rj. 
We call r; the residue of ia (modulo p). 
Exercise 4.37. (1) Explain why all the residues of ia are non-zero: r; # 0 for 
alll <i<ob. 


(2) Prove that ifr; <0 andr; > 0, then r; +1; is not divisible by p. Hint: Can 
(i+ j)a be a multiple of p? 
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Exercise 4.38. Prove that 


(4.82) TL Gr [Dn =2:- 


Here, the exclamation mark means factorial, so that it is b! instead of us getting 
really excited about b. © 


Example 4.45. Let p = 11, so that b = 5. Let a = 8. Then, the first 5 positive 
multiples of a are 8,16, 24,32,40. The corresponding residues are 

(4.83) —3, 5, 2, -1, —4. 

And, indeed, 3:5-2-1-4= 51. 


4.11. Appendix 


4.11.1. Equivalence of induction and the well-ordering principle. Firstly, we 
show that mathematical induction and the well-ordering principle are equivalent: 


Proof of Theorem [4.5} We want to prove: 
mathematical induction <  well-ordering principle. 


(=) Assume mathematical induction. Let A be a subset of Z* that has no least 
element. It suffices to prove that A is empty. 


Let P(n) be the statement that N, c Zt — A; that is, ANN, = 9. 


Base case: Suppose for a contradiction that 1 € A. Since 1 is the least element 
of Z* and since A is a subset of Z*, we conclude that 1 is the least element of A, 
a contradiction to our assumption on A. Therefore 1 ¢ A, which is equivalent to 
saying Ny c Zt — A= A° (taking Z* to be the universal set). 

Inductive step: Suppose that k is a positive integer such that N; C A‘°. Suppose 
for a contradiction that k +1 € A. Suppose m € Z* satisfies m <k+1. Then 
m € Nx, which implies that m ¢ A. This proves that k + 1 is the least element of 
A, a contradiction. Therefore k+1¢ A. This and N; Cc A‘ imply that Ny41 Cc A. 


Since we have assumed that mathematical induction is true, we can now con- 
clude that P(n) is true for all n € Zt. In particular, for all n € Zt we have 
n€N, Cc A‘, which says that n ¢ A. That is, no positive integer is in A. Since A 
is a subset of the set of positive integers, we conclude that A is empty. 


(<=) Assume that the well-ordering principle is true. We want to prove mathemat- 
ical induction. 
To this end, let A be a subset of Z* which satisfies the following properties: 
(1) LEA. 
(2) Forallke Zt, ifke A, thn k+1€ A. 
We need to show that A = Z*; that is, Ac = 0. 


Suppose for a contradiction that A‘ is non-empty. Since we have assumed that 
the well-ordering principle is true, we have that A° has a least element. Call this 
least element ¢. Since ¢ € A‘, we have ¢ ¢ A. 
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Since property (1) says that 1 € A, we have that £4 1. Since ¢ € Z", this tells 
us that @> 2. 

Suppose for a contradiction that 2—1 € A. Since 0 > 2, 2-1 € Z*. So property 
(2) and €—1 € A tell us that @ = (¢—1)+1 € A, which contradicts ¢ ¢ A. We 
conclude from this that ¢— 1 ¢ A; that is, 2-1 € A®. 

But since €—1 < @, this contradicts that @ is the least element of AC. From all 
of this we conclude that A‘ is empty, as desired. 


4.11.2. Finite sets have extrema. Secondly, we consider the minimum and 
maximum for finite sets of real numbers. The following result is proved by induction. 


Theorem 4.46. If X is a non-empty finite set of real numbers, then X has both a 
minimum element and a maximum element. 


Proof. We prove that X has a minimum by induction on the cardinality of X. 
The existence of a maximum is proved similarly. 

Base case: n = 1. In this case X consists of a single real number, which must 
be (both the maximum and) the minimum of the set. 

Inductive step. Suppose that k € Z* has the property that the theorem is 
true for all sets of real numbers of cardinality k. Let X be a set of real numbers 
with cardinality k + 1. Choose an element of X and call it xo. 

Case 1: x is the minimum element of X. Then we are done. 

Case 2: 2 is not the minimum element of X. Let Y = X — {ap}. Then Y is 
a finite set of real numbers with cardinality k. Hence we can apply the inductive 
hypothesis to obtain that Y has a minimum element yo. 

We claim that yo is the minimum element of X. Actually this is pretty easy to 
see: Let x € X. 

If x = xo, then since zo is not the minimum element of X, there exists 71 € X 
such that « = x29 > 21 > yo, where the last equality follows from x; € X—{xo} = Y 
and yo being the minimum element of Y. 

On the other hand, if « 4 zo, then « € X — {xo} = Y. Since yo is the minimum 
element of Y, we have that x > yo. 

In either case, we have proved that « > yo. We conclude that yo is the minimum 
element of X. This completes the inductive step. 


By induction we are done. 


4.11.3. An alternate proof of the well-ordering principle. Using Theorem 
[4.46] we can give an alternate proof of the well-ordering principle. 


Proof of Axiom Let X be a non-empty subset of Zt. Since X is non-empty, 
there exists an integer no in X. Let 
(4.84) Y:={xeEeX:1<aK<no}. 


Then Y is non-empty since no € Y, and Y is finite since Y C N,,,. Hence, by 
Theorem [4.46] Y has a minimum element, which we call m. 
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We will now prove that m is the minimum element of X. Let x € X. 


Case 1: x < no. Then x € Y. Since m is the minimum element of Y, we have 
ma. 


Case 2: x > 1p. Then m < no < 2, so that m < a. 


This proves that m is the minimum element of X. 


4.12. Hints and partial solutions for the exercises 


Hint for Exercise Let i := r(n). Consider two cases: (1) i = 0 and 
(2) 0 <i <3. In case (1) explain why n is the only integer divisible by 3. In case 
(2) explain why n + 3 —7 is the only integer divisible by 3. 

Hint for Exercise [4.2] By the well-ordering principle, X has a minimum element 
m. Using this, what should the minimum element of Y be? Prove it. 


Hint for Exercise Let f(n) =n? +n+41. We have 
(4.85) f(40) = 40? + 40 + 41 = 40-414. 41 = 412, 


which is not prime. 


Hint for Exercise Use the hint given. You may compare with the hint for 
Exercise [4.1] if you like. 


Hint for Exercise To prove existence, you can take the gq and r from the 
Division Theorem and subtract 3 from q, which adds 15 to r. 


To prove uniqueness, you may “transfer” the problem to the case of the Division 
Theorem, or alternatively you may mimic the proof of uniqueness in the Division 
Theorem. 


Hint for Exercise We have a = mq(a) +r(a) and similarly for b. 


Hint for Exercise Let n be an even integer. Then there exist integers q,r 
such that n = 4q+r where r € Ry is even; that is, r equals 0 or 2. Consider these 
two cases. 


Hint for Exercise There exist integers k € Z and r € Rs = {0,1,2} such 
that 


(4.86) n= 3k-+r. 
We compute that 
(4.87) n? = 3(3k? + 2kr) +r. 


Consider the following three cases: (1) r = 0, (2) r = 1, and (3) r = 2. We have 
for example in case (2) that n? = 3q +1, where q = 3k? + 2k. 
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Hint for Exercise We have that m? = 3q1 +11 and n? = 3q2 + re, where 
0 < T1,7T2 < 1. Thus 


(4.88) m? +n? =3(q1 +42) +71 +12, 
where 

(4.89) O<r+re <2. 
Continue. 


Hint for Exercise Consider the two case where n is even or odd. 


Hint for Exercise Let n be a square of an integer. Then there exist integers 
ke Zandr € Rs = {0,1,2,3,4} such that 


(4.90) n=5k-+r. 
If, for example, r = 4, then n? = 5g +1, where g = 5k? + 8r +3. 
Hint for Exercise The completed table follows: 


remainders of perfect squares after dividing by m 
0,1 


= 
Semntankww]s 
cogocoococsd 
oon 


Hint for Exercise [4.13 


We have the following table of elements of R,, that are not remainders of perfect 
squares: 


m | elements of R,, that are not remainders of perfect squares 
2 none 

3 2 

4 2,3 

5 2,3 

6 2,5 

7 3,5,6 

8 2,3, 5, 6, 7 

9 2,3,5,6,8 

10 2,3,7,8 
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Hint for Exercise |4.14 

Condition | m | Integers ruled out from being perfect squares 
2, 5, 8, 11, 14, 17, 20, 23, 26, 29, 32, 35, 38, 41, 44, 47, 50, 
53, 56, 59, 62, 65, 68, 71, 74, 77, 80, 83, 86, 89, 92, 95, 98 
2, 3, 6, 7, 10, 11, 14, 15, 18, 19, 22, 23, 26, 27, 30, 31, 34, 
2) 4 35, 38, 39, 42, 43, 46, 47, 50, 51, 54, 55, 58, 59, 62, 63, 66, 
67, 70, 71, 74, 75, 78, 79, 82, 83, 86, 87, 90, 91, 94, 95, 98, 99 
2, 3, 7, 8, 12, 18, 17, 18, 22, 23, 27, 28, 32, 33, 37, 38, 42, 
3) 5 43, 47, 48, 52, 53, 57, 58, 62, 63, 67, 68, 72, 73, 77, 78, 82, 
83, 87, 88, 92, 93, 97, 98 
2, 5, 8, 11, 14, 17, 20, 23, 26, 29, 32, 35, 38, 41, 44, 47, 50, 


(1) 3 


4) 6 53, 56, 59, 62, 65, 68, 71, 74, 77, 80, 83, 86, 89, 92, 95, 98 
3, 5, 6, 10, 12, 13, 17, 19, 20, 24, 26, 27, 31, 33, 34, 38, 40, 
(5) 7 41, 45, 47, 48, 52, 54, 55, 59, 61, 62, 66, 68, 69, 73, 75, 76, 80, 82, 


83, 87, 89, 90, 94, 96, 97 

5.3. 8. 8, 7, (0c 11; 13, 14, 15,18, 19, 91, 32, 9a, 96.97, 20 

30, 31, 34, 35, 37, 38, 39, 42, 43, 45, 46, 47, 50, 51, 53, 54, 55, 
(6) 8 | 58, 59, 61, 62, 63, 66, 67, 69, 70, 71, 74, 75, 77, 78, 79, 82, 83 

85, 86, 87, 90, 91, 93, 94, 95, 98, 99 

2, 3, 5, 6, 8, 11, 12, 14, 15, 17, 20, 21, 23, 24, 26, 29, 30, 32, 33, 
(7) 9 | 35, 38, 39, 41, 42, 44, 47, 48, 50, 51, 53, 56, 57, 59, 60, 62, 65, 66, 
68, 69, 71, 74, 75, 77, 78, 80, 83, 84, 86, 87, 89, 92, 93, 95, 96, 98 


12, 13, 17, 18, 22, 23, 27, 28, 32, 33, 37, 38, 42, 43, 47, 48, 52, 53, 
57, 58, 62, 63, 67, 68, 72, 73, 77, 78, 82, 83, 87, 88, 92, 93, 97, 98 


(8) 10 
The integers from 1 to 100 that are not ruled out as perfect squares by considering 
division by m for 3 <m < 10 are 
(4.91) 1,4, 9, 16, 25, 36, 49, 64, 81, 100. 

Each of these integers is a perfect square! So, for the first 100 positive integers, we 
can rule out all non-perfect square by considering division by m for 3 < m < 10. 
Hint for Exercise Let x and y be integers. Let P(n) be the statement that 
xz” — y” is divisible by x — y. 

Clearly P(1) is true since x— y is divisible by x— y. Suppose that P(k) is true. 
Then there exists an integer a such that 
(4.92) a® —y* = a(x —y). 

We compute that 


k+1 _ yhtl _ (gk+1 _ yk 


+ (a*y — y**) 


) 
a* (a —y) + (a* —y")y 
= o*(2—y) +a(e—y)y 
= (x* + ay)(x — y). 


Since x” + ay is an integer, this proves that «*+1! — y*+! is divisible by x —y. That 
is, P(k + 1) is true. 
The result follows from induction. 


y 
a 
4 
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Hint for Exercise [4.16] For each n € Z*, let S(n) be the set of primes that 
divide at least one of the integers a1, @2,...,@. Each S(n) is a finite set. By the 
Prime Factorization Theorem, $(1) is non-empty. Clearly, S(n) C S(n+1). We will 
prove that S(n) is a proper subset of S(n +1) for each n. That is, there exists an 
element of S(n+1) that is not an element of S(n). Suppose for a contradiction that 
S(n) = S(n +1). Let p be a prime that divides a,4,. Then p € S(n+1) = S(n). 
Since p € S(n), p divides a; for some 1 <i <n. Therefore gcd(an41,a;) > p> 1 
for some 1 <i <n, which is a contradiction. 

Now we have |$(n + 1)| > |S(n)| +1 for all n € Z*. This implies that there 
are an infinite number of primes. 


Hint for Exercise We follow the hint and we first prove by induction that 
n-1 
(4.93) [[%="-2 forn>1. 
i=0 
Check the base case. 
As our inductive hypothesis, suppose that k € Z* has the property that 


k-1 
(4.94) [fam -2 
1=0 


Show that 
Fey = (2? 7 +1 
= (F,)? —2F, +2. 
By multiplying by Fy, we obtain 


k 
(4.95) || 5% = (4)? - 24. 
1=0 


Finish the proof by induction of (493). 


Now we prove that the Fermat numbers are pairwise coprime. Let k < @ be 
non-negative integers. We have 


£-1 


Fp=|[A+2=2+h%- [] F&F. 
i=0 O<i<e—1, i4¢k 


Suppose for a contradiction that F;, and Fy are not coprime. Since Fy and Fy 
are both odd, this means that there exists an odd prime p dividing both Fi, and 
Fy. However, this is a contradiction since then the previous display implies that p 
divides 2. 


Hint for Exercise We prove this by induction. Check the base case. As 
our inductive hypothesis, suppose that k > 2 has the property that F;, has last 
digit equal to 7. By the calculation in the solution to the previous problem, we 
have that 


(4.96) Fyai = (Fy)? — 2Fy + 2. 
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Moreover, by our inductive hypothesis there exists an integer q such that 
(4.97) Fy, = 10g + 7. 
Show that 

Fy41 = 10(10q? + 12¢ + 3) +7. 


Hint for Exercise As a crude estimate, since 27 < 10 < 24, we have 
9300 < 4(100 — 9400. 

Hint for Exercise The gcd is 4. 

Hint for Exercise The gcd is 6. 

Hint for Exercise 4 = —95- 188 + 22-812. 

Hint for Exercise 6 = —49- 258+ 17-744. 

Hint for Exercise m = 8 and n = —39. 


Hint for Exercise Since p and q are distinct and prime, they are coprime. 


Hint for Exercise Observing that 12.2 = 24 and 5-5 = 25, we compute 
that 


(4.98) (—12)(5n + 2) + (5)(12n + 5) = 1. 

By Theorem [4.20] we conclude that gcd(5n + 2,12n + 5) = 1. 
Hint for Exercise We compute that 

(4.99) (—c)(an + b) + (a)(en+ d) = —bc+ ad = 1. 


Hint for Exercise [4.28] Suppose that a prime p divides the product ab. We have 
g := gcd(p, a) is a positive integer dividing p. Hence g = 1 or g = p. 
If g = p, then since g divides a, we have that p divides a. 
On the other hand, if g = 1, then by Theorem [4.23] we have that p divides b. 
We have proved that p divides a or p divides b. 


Hint for Exercise Let a,b,c be non-zero integers. The statement we 
want to prove is the contrapositive of the implication: If (a and b are coprime) and 
(a and c are coprime), then a and bc are coprime. This statement is true by Lemma 


[4.29] 


Hint for Exercise Let g = gcd(a,b). Then g divides both a and b. This 
implies that g” divides both a” and b”. We conclude from this that 


(4.100) g” <gced(a”, bd”). 
Now suppose for a contradiction that 
(4.101) g” < gced(a”,b”). 
We have that g” divides gcd(a”,b”). Thus there exists an integer k > 1 such that 
(4.102) gcd(a”, b”) = kg”. 
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Let p be a prime that divides k. Then g”p divides both a” and b”, which implies 
that p divides both (s) and (4) . This in turn implies that p divides both 4 


and a This proves that g is not the gcd of a and b, which is a contradiction. 
Hint for Exercise Show that m, and m2/g are coprime. 


Hint for Exercise We prove Corollary [4.34] by induction on k. 
The base case k = 1 is obvious. 


Suppose that k € Z* has the property that for every pairwise coprime positive 
integer m 1,7™2,..., Mx, if each of m1,mz2,..., mx divides an integer x, then their 
product m1m2---m, divides x. 


Now suppose that m,,m2,...,™41 are pairwise coprime positive integers 
such that each of m ,m2,...,M41 divides an integer x. Since then each of 
mM1,M2,...,M, divides x, we have that m,m2---m, divides x. Now since the 
my ,™Mg,.-.-,Mz+1 are pairwise coprime, we have 
(4.103) gcd(m img “Nk, Mp+1) =<: 


Since m4 1 also divides x, by Proposition |4.33| we have that the product 
(4.104) M1M2°+*Mk* MEE = ™MyMQ°++ M41 


divides x. This proves the inductive step. 

By induction, we are done. 

Finally, to be complete, we prove (4103) by induction. 

Suppose as an inductive hypothesis that k is a positive integer with the 
property that if m,,m2,...,mz,b are pairwise coprime positive integers. Then 
gcd(m ima2---mz,b) = 1. 

Let m1, ™M2,..., M41, b be pairwise coprime positive integers. By the inductive 
hypothesis, gcd(m ,mz2--+-m z,b) = 1. This and ged(mz41, b) = 1 imply that 


(4.105) 1 = ged((mymz +++ Mz) M41, 6) = ged(mMimy +++ Mz 44, 8). 


This proves the inductive step, so by induction we have proved (4103). 


Note that for (4.105) we used the fact that if gcd(a,c) = gcd(b,c) = 1, then 
gcd(ab,c) = 1. This is one of the lemmas in the book. 


Hint for Exercise Multiply the solution to Exercise by 7. 
Multiply the solution to Exercise [4.23] by 3. 


Hint for Exercise Use Theorem [4.38] 

Hint for Exercise We compute using a “big number calculator” that 
(4.106) 3987'? = 16134474609 751 291 283496 491970515151 715346481 
and 


(4.107) 4365'? = 47842181 739947 321332739 738982639 336 181640625. 
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Therefore 

(3987! + 4365!7)!/!? — 63976656 349698612616 236 230953 154487 896 987 1061/1? 
= 4472.0000000071. 

The answer is very close to, but not equal to, an integer! 


Next, we prove that 


(4.108) 39877? +4365!" 4 44727, 
Let n be an odd integer. Then there exist integers g and r such that 
(4.109) n=4q+r, 


where r equals 1 or 3. 

Case 1: r =1. In this case, 
(4.110) ni? =4s+1 
for some integer s. 

Case 2: r = 3. In this case, there is an integer s such that 

r* = 43 + 3* = 4(s + 20) +1. 
Thus, there is an integer t such that 
pi? = (r*)8 = 4t4+ 1. 

We have just proved that any odd integer to the power 12 when divided by 4 has 
a remainder of 1. 


From all of this, we conclude that the integer 3987!* + 43651? when divided by 
4 has a remainder of 2. However, 4472!? when divided by 4 has a remainder of 0. 
Thus 3987/2 + 4365!” cannot be equal to 4472!?. 


Hint for Exercise [4.36] Firstly, apply the Division Theorem to get a quotient 
qd and a remainder r’ withO <r’ <p. (1) Ifr’ <b, let r =r’. (2) If r’ > 6, let 
r =r’ —p. Continue to prove existence. To prove uniqueness, mimic the proof of 
the uniqueness part of the Division Theorem. 
Hint for Exercise (1) Show that p cannot divide ia for 1 <i < b. 

(2) We have 0<i+ 7 < 2b <p. So (i+ J)a cannot be a multiple of p. 
Hint for Exercise |4.38} When r; < 0, we have 0 < —r; < b. When r; > 0, 


we have 0 < r; < b. None of the —r;’s and r,;’s are congruent to each other, and 
therefore they are distinct. 


EEE —— 


Chapter 5 


Sets and Functions 


The &’s = {Ad, Kh, Qde, Je, 10h, Ide, Ste, 7 ho, Ce, She, Ade 3h, 2c}. 
The ’s = {AO, KO, QO, JO, 100, 9,84, 7,60, 50,40, 3, 20}. 
The 0’s = {AY, KY, QY, JY, 109, 99,89, 79,69, 59,49, 39, 29}. 
The @’s = {A@,K@, Qd,J&, 100,90, 80, 70,60,50,44,340,24}. 


Figure 5.0.1. Four 13-element sets whose union is a deck of cards, without Jokers (or thieves) 


®@ THE JOLLY ® 


Figure 5.0.2. A lonely joker. This work has been released into the public 
domain by its author, Trocche100 at Italian Wikipedia. 


Goals of this chapter: To introduce the language of mathematics: set theory. 
To understand general properties of functions. To relate functions to counting 
the number of elements in finite sets. To understand injections, surjections, and 
bijections. 


We begin with some fun examples of sets: a set of suits, with four elements: 
(5.1) {&,>, 9, a}. 
A set of suites, with three elements: 
{Bach Suite No. 3 in D major, Holst The Planets Suite, Grieg Peer Gynt Suites}. 
A set of sweets, with four elements: 
{Creme tangerine, Coffee dessert, Cool cherry cream, Coconut fudge}, 


not to mention a ginger sling with a pineapple heart, all after the Savoy Truffle|| 


1Rock trivia question: Why is this Beatles song, written by George Harrison, about Slowhand, 
a.k.a. Eric Clapton? 
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5.1. Basics of set theory 


Per its namesake, we start with: 


5.1.1. The notion of a set. By definition, a/set)|is a collection of objects. 

For example, Z, Q, R denote the sets of integers, rational numbers, and real 
numbers, respectively. Z* denotes the set of positive integers. We have also con- 
sidered the set EF of even integers and the set O of odd integers, as well as many 
other sets. 


Sets occur in everyday life, especially when we organize or categorize stuff. 


5.1.2. Elements of sets. We call the objects in sets If x is an 
element of a set A, then we denote this by 


ce A. 


So « € R is just another way of saying that xz is a real number. For example, 7 € R 
since 7 is a real number. 


If an object x is not an element of A, then we denote this by 


cE A. 
That is, x ¢ A if and only if not (« € A). For example, 3 ¢ Z since 3 is not an 
integer. We also have 7 ¢ Z. But 3 € Q since 3 is a rational number. And x ¢ Q, 


although this last fact is 


Figure 5.1.1. The element z is a “hare” inside A; i.e., 2 € A. Also, y is not 
an element of A; i.e., y ¢ A. 


5.1.3. Subsets and equality of sets. Now that we know what a set is, we want 
to understand relations between sets. 

We say that a set A is a'subset)of a set B, read as A C B, if x € A implies 
x € B. In other words, every element of A is an element of B. 

We say that two sets A and B are/equal, denoted A = B, if and only if AC B 
and B Cc A. Equivalently, A = B if and only if « € Asa € B. In other words, 
the elements of A are the same as the elements of B. 


Example 5.1. The set of positive integers is equal to the set of integers greater 
than or equal to 1. This is because there is no integer n satisfying 0 <n <1. 
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Observe that we have the inclusion of sets 
(5.2) ZCQ and QCR. 


This is because every integer is a rational number and every rational number is a 
real number. 

We say that a set A is a proper subset of a set B if A is a subset of B and 
AFB. 

It is easy to see that Z is a proper subset of Q since 4 is rational but is not an 
integer. 

By Theorem |3.45| which states that the square root of 2 is an irrational real 
number, we have that Q is a proper subset of R. 


We have the following basic property of the subset relation. 


Lemma 5.2 (Transitivity of the subset relation). If A C B and B C C, then 
ACC. 


Proof. Suppose that A C Band BCC. Let « € A. Since A C B, this implies 
xz € B. Now, since B C C, this implies 2 € C. We have proved that if « € A, then 
xéEC. That is, ACC. 


Do you notice any similarity between the proof of this lemma and the proof of 


Lemma [3.29] (Exercise 3.21)? 


5.1.4. Sets of sets: A primer. A set itself can be an element of another set. 
Consider the following set, which is different than the set in (6.1): 


(5.3) S := {The &’s, The }’s, The 0’s, The @’s}, 


where the elements The #&’s, The <’s, The (’s, and The @’s are given by Figure 
at the beginning of this chapter! The set S is a set with 4 elements, where 
each element is itself a set with 13 elements! For example, we have 


QVe TheY’s and TheO’seS. 


So, in set theory, it is perfectly fine to consider sets of sets, although there are 
some paradoxes lurking around which require any rigorous treatment to be well 
thought out. We will mention some of the issues in set theory, but for the most 
part we will sidestep a detailed treatment of the (commonly accepted) axioms of 
set theory. 


Exercise 5.1. Explain why QO ¢ S. 


Example 5.3. Let 2 = {2@, 29,2, 2é},...,2 = {A@, AO, A>, A&}. Then we 
have the set of sets T = {2,3,...,K, A}, which has 13 elements, each of which is a 
set of 4 elements since there are 4 suits. 
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5.1.5. Much ado about nothing: The empty set. 


Zero, zip, zilch, nada ... yada yada yada 


A fundamental, but somewhat trivial, set is an empty set|(. This is a set 
with no elements. In contrast to its name, the empty set appears ubiquitously in 
set theory. 


For the following, let us assume that a zebra is a well-defined mathematical 
notion. 


Proposition 5.4 (x € 9 implies your favorite statement, true or false). Let @ be 
an empty set. If x € 9, then = is a zebra. 


Proof. This is a true statement! This is because the hypothesis of the implication, 
x € 9), is false since @ has no elements. Any implication with a false hypothesis is a 
true statement. 


The idea of the proposition above implies that there is only one empty set: 


Theorem 5.5 (Uniqueness of nothing). The empty set is unique. 


Proof. Suppose 9 and §’ are two empty sets; i.e., each of these two sets has no 
elements. Then the implication x € @ implies x € 0’ is true since x € @) is false 
(because ) has no elements). Similarly, the implication 2 € §’ implies x € @ is true 
since 0’ has no elements. Hence 


(5.4) 0=0'. 

Exercise 5.2. (1) Let 0 be an empty set. Prove that for every set A, we have 
that 

(5.5) OCA. 


(2) Use this to give another proof of Theorem 


Since an empty set is unique, from now on we will refer to it as the empty set. 


Exercise 5.3 (Part of nothing is nothing). Prove that if a set A is a subset of 0, 
then A= 9). 


Question: What did the empty set say during an existential crisis? 
Answer: Why is there nothing rather than something? 


We say that a set A is non-empty if A 4 0. In particular, if A contains some 
element, call it x9, then A is non-empty. We also observe that if B is a non-empty 
set, then ( is a proper subset of B. 


Given the empty set @, we can define the set 


(5.6) {O}. 
This set is non-empty! Indeed, it has a single element, the empty set, which is 
something! That is, in this case, nothing is something. © We can also consider 
{{0}}, which is the set whose sole element is the set {0}. 

To make sure we distinguish between similarly appearing notations, we consider 
the following. 
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Solved Problem 5.6. For each of the following statements, either prove or dis- 
prove: 


(1) d€90. 
(2) Oe {O}. 
(3) 0c {O}. 


Solution. (1) This statement is false. The empty set has no elements, but the 
statement 9 € @ says that @ is an element of 0. 


(2) This statement is true. The set {0} is the set with one element, where the 
one element is 0. So, 0 € {@}. 


(3) This statement is true. This is because {} is a set and the empty set @ is 
a subset of any set. 


By the set {{@}} we mean the set with one element, where the one element is 
{@}. o 


To try to further confound matters, we present: 


Exercise 5.4 (Out of the void). For each of the following statements, either prove 
or disprove: 


(i) {0} Cc {0}. 
(ii) {O} © {O}. 
{O} € {{O}}. 

{0} C {{O}}- 

0c {{O}}. 

Oe {{O}}. 

5.1.6. Russell’s Paradox. We briefly discuss one of the paradoxes in the unre- 


strained consideration of sets of sets, as in 45.1.4) Russell’s Paradox, which mildly 
boggles the mind, is the following. Let 


) 
(iii 
(iv) 
(v) 

) 


(vi 


R be the set of all sets that are not elements of themselves, 


Figure 5.1.2. Bertrand Russell] (1872-1970). Dutch National Archives, Au- 


thor: Anefo, Creative Commons CC0 1.0 Universal Public Domain Dedication. 


208 5. Sets and Functions 


known as the “Russell set”. In other words, we define 
(5.7) R:={A:A¢ A}, 
where the object A to the left of the colon is a set. We have: 


(1) If A € R, then by the definition of R we have A ¢ A. 
(2) If A ¢ R, then by the definition of R we have not A ¢ A; that is A € A. 


Applying this to the set A = R, we obtain: 


(1) FRER, thn RE R. 
(2) If R¢ R, then RE R. 


Thus, we have proved 
(5.8) ReRsS RER. 


In any case, we have a contradiction. So we conclude that the Russell set, despite 
its name, is not a set. We discussed the oxymoronic Russell set as a cautionary 
tale, the moral being that rigorous set theory must find a way to disallow such 
self-contradictory, fantastic beasts. However, we will not go into the nuances of set 
theory, as all of the objects we will consider will be well-defined. 


5.1.7. Unions, intersections, and complements. Now we consider basic op- 
erations on sets. Let A and B be sets. We define their union) A U B to be the set 
whose elements are in A or B. In other words, 


AUB={x«:ax¢€Aorze Bh. 


That is, an object x is an element of AU B if and only if x is an element of A or x 
is an element of B. 
We define the [intersection] AM B of A and B to be the set whose elements 
are in A and B. In other words, 
ANB={«x : «¢€Aandze Bh. 


That is, an object x is an element of AM B if and only if x is an element of A and 
x is an element of B. Figure|5.1.3] visualizes the union and intersection of two sets. 


Example 5.7. Let EF be the even integers and let O be the odd integers. We then 
have 


(5.9) EUO=Z and EnO=%. 

Example 5.8. Given an integer m, let 

(5.10) mZ := {ma : « € Z} 

denote the set of multiples of m. If a@ and 6 are non-zero integers, then 
(5.11) aZM bZ@ = \em{a, }]Z, 


where lcm denotes the least common multiple. We leave this as an exercise for you 
to prove. 


For example, 
51ZN 68Z = 204Z. 
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A B AUB ANB 


Figure 5.1.3. Two sets, their union, and their intersection. 


When we fix a set U such that all sets in our discussion are subsets of it, we 
call U the universal set or ambient set or 


Caveat: By universal set, we do not mean the standard notion of 
in set theory, where in that case the universal set is the set of all objects, including 
itself. This actually leads to Russell’s Paradox. 


If we fix a universal set U, the of a set A (A is assumed to be a 


subset of U) is the set 

AP=l(pelseg A}. 
Figure visualizes a set and its complement. 
Example 5.9. Let Z be the domain of discourse. Then 
(5.12) E°=O and OC=E. 


Example 5.10. Let R be the domain of discourse. Then Q° is the set of irrational 
numbers. 


Example 5.11. Let Z be the domain of discourse, and let m be a positive integer. 
Then (mZ)° is the set of integers that are not multiples of m. 
Exercise 5.5 (Double negative). Show the following: 

(1) @t@EASear€ At. 

(2) (Ae)° = A. 


Exercise 5.6 (Contrapositive). For subsets of a universal set U, A C B if and 
only if BS Cc AS. 


Figure 5.1.4. A set A and its complement A°. 
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Solved Problem 5.12 (Subsets and complements). In Exercise you showed 
that for every set A and B, A C B if and only if B® C A®%. Explain how this 
is related to the contrapositive of an implication being logically equivalent to the 
implication. 


Solution. Let U be the domain of discourse. 


We have that A C B if and only if: for all « € U, x € A implies x € B. 

We have that B® C A° if and only if: for all « € U, x € B° implies x € A°. 
That is, for all x € U, not (x € B) implies not (x € A). This implication is the 
contrapositive of the implication in the previous paragraph. 


So, from the logical equivalence of an implication with its contrapositive, we 
see that A C B if and only if B® Cc A®. 


Exercise 5.7 (Returning the “complement”). Let U be the domain of discourse. 
For this exercise, do not use Lemma 6.13] below. But you may use that (C°)* = C 
for every set C. 

(1) Prove that a subset A satisfies A° = Q if and only if A=U. 

(2) Prove that a subset B satisfies B° = U if and only if B=9. 


Recall from Examples [5.7] and 5.9] that E UO = Z, ENO = 9, E° = O, and 


O° = E. This is an example of two sets partitioning a larger set. We can think 
of the universal set as a house and each of the sets as a room. The whole house 


consists only of the two rooms, and the two rooms have nothing in common. More 
generally, we have the following. 


Lemma 5.13. Let A be a subset of a universal set U. Then: 
(1) AUAS=U. 
(2) ANAS=9. 
Hence, A and A® partition U. 
Exercise 5.8. Prove Lemma[5.13] Do not use the De Morgan laws, which will be 


discussed below. 


5.1.8. The set difference. So far, we have introduced the operations on sets 
of unions, intersections, and complements. We now consider a derivative operation. 
Let U be the universal set, and let A and B be two sets. The set difference of A 
minus B is defined by 


(5.13) A-B:=ANB*. 
That is, A — B is the set of objects in A that are not also in B. Observe that 
(5.14) A-BCA and A-BCB*. 


We may visualize the set difference by Figure [5.1.5 
Example 5.14. For any set A, we have 
A-AS=A. 


The properties discussed in the next subsection will enable you to prove this fact 
easily. For now, we give a direct proof. Let « ¢ A— A®. Then a € A and a ¢ A‘. 
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A B 


Figure 5.1.5. The set difference A — B. 


Thus x € A. This proves that A— A° C A. Conversely, let « € A. Then a € (A‘°)°, 
which implies z ¢ A°. Hence « € A— A‘. The desired set equality follows. 


Exercise 5.9. Let U be the universal set, and let A and B be sets. Prove the 
following set equalities: 


(1) U-B=Be 
(2) A-O=A. 
(3) A-A=6. 
(4) 0-B=0. 
(5) A-U=9. 


5.1.9. Set operation properties. Given the set operations we have introduced, 
we can easily prove some basic properties. 


Lemma 5.15. For any sets A and B, 
(5.15) ANBCACAUB. 


Proof. Let x €¢ ANB. Then x € A and x € B. This implies that x € A. Thus we 
have proved that AN Bc A. 


Let « € A. Then x € Aorz € B. This says that c € AUB. Thus we have 
proved that A C AUB. 


Figure [5.1.6] visualizes (as) Venn diagrams) the set inclusions in (6.15). 


A B 


Figure 5.1.6. From this Venn diagram we can see that AN BC AC AUB, 
A-—BcCA,andA—BC B*. 
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The equality cases of the subset inclusions of the lemma above are characterized 
by: 


Lemma 5.16. 

(1) ANB=AS ACB. 

(2) A=AUB SBCA. 
Proof. (1) We prove the = direction of the biconditional, and we leave the = 
direction as an exercise. Suppose AN B = A. This implies that AC AN BC B. 


(2) Again, we prove the = direction of the biconditional, and we leave the = 
direction as an exercise. Suppose A = AUB. This implies that BC AUB C A. 


Note that Lemma [5.16{1) is characterized by the pink region in Figure [5.1.6] 
having no elements, whereas Lemmaf5.16|2) is characterized by the light blue region 
in Figure [5.1.6] having no elements. 


Exercise 5.10. Let A and B be sets. Prove: 


(1) ACBS ANB=A. 
(2) BCAS>A=AUB. 


This completes the proof of Lemma 
Note that, as special cases, we have 
(5.16) ANA=A, A=AUA. 
Exercise 5.11. Let A and B be sets. Prove that 
AUB=ANB S A=B. 


Regarding the interaction between complements and intersections and unions, 


the De Morgan laws are: 

Lemma 5.17. Let A and B be sets in a universal set U. Then 
(5.17) (AN B) = ASUB, 

(5.18) (AU B)® = ASN BS. 


Proof. We prove (5.17), and we leave (5.18) as an exercise. We have the following 
string of biconditionals: 


zé€(ANB) ex¢ ANB 
= not (cE ANB) 
= not («4 € Aand ze B) 
= not (« € A) or not (x € B) 
@=ax€A°orxe BS 
re AUB". 


Figure 8.6.3] visualizes De Morgan’s laws. 


Exercise 5.12. Prove (AU B)* = A°NB°. This proves (5.18) and hence completes 
the proof of Lemma [5.17] 
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Solved Problem 5.18 (De Morgan laws for logic and sets). Explain how the 
De Morgan laws in Lemma|5.17\ are related to the De Morgan laws in Solved Problem 


[3.32] 


Solution. We see the relationship by defining 

P(a) to be the statement € A and Q(z) to be the statement x € B. 
The law (AN B)* = A°U B® says 

xé€(ANB) ifandonlyif 2¢€ A°UB*; 
that is, 
not («¢€ Aandxze B) ifandonlyif (not # € A) or (not x € B), 
or in other words (think about this more abstractly), 
not (P(x) and Q(x)) if and only if (not P(x)) or (not Q(z)) . 

This last statement is one of the two De Morgan laws in Solved Problem B.32] 


Similarly, we see that the law (AU B)° = A°N B* is equivalent to the bicondi- 
tional 


not (P(x) or Q(x)) if and only if (not P(«)) and (not Q(z)) 
is the other De Morgan law in Solved Problem [3.32] 


5.1.10. How unions and intersections interact. We now consider properties 
restricted to only unions and intersections. It is rather easy to see that both unions 
and intersections are commutative; that is, 


(5.19) AUB=BUA, ANB=BNA. 
They are also associative: 
(5.20) (AUB)UC=AU(BUQC), (ANB)NC=AN(BNC). 
For this reason we often write the sets above just as 
(5.21) AUBUGZC, ANBNC. 
We also have the following |distributive properties; 
(5.22a) AU(BNC) =(AUB)N(AUQ), 
(5.22b) AN (BUC) = (AN B)U(ANC). 


We don’t bother to prove the properties above but rather leave the details to 
the interested reader. Don’t worry! We allow you to use these properties even 
though we haven’t proved them! You can use Figure [5.1.7] to visualize why these 
properties are true. 


Lemma 5.19. If A,, Ao, Bi, Bo are sets, then 
(5.23) (Ay U Ag) N (Bi U Bo) = (ALN By) U (A1N Ba) U (Ao Bi) U (Aa By). 
Exercise 5.13. Prove Lemma .19) 
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Figure 5.1.7. Venn diagram for the union of sets A, B,C. 


5.2. Cartesian products of sets 


We now discuss a set operation that impacts how we fundamentally view the uni- 
verse, classically, as Euclidean space. 


Let X and Y be sets. Their|cartesian product|is the set defined by 
(5.24) XxY:={(a,y): cE xX, yeY}. 


Here (x,y) denotes an ordered pair, so that (z,y) = (z,w) if and only if « = z 
and y = w. We denote 


(5.25) X? = Xx X. 

Example 5.20. Let X = {a,b,c,d} and Y = {1,2,3}. Then their cartesian 
product is given by 

XxY={(a, 1), (a, 2), (a, 3), (b, 1), (0, 2), (b, 3), (c, 1), (¢, 2), (c, 3), (d, 1), (d, 2), (d,3)}. 
We can picture the cartesian product as in Figure 5.2.1] 


(a,3) (b,3) (c,3) (4,3) 
(a,2) (b,2) (c,2) (4,2) 
(a,1) (1) (1) (@1) 


XxY 


Figure 5.2.1. Visualization of the cartesian product of X = {a,b,c,d} and 
Y =41;.2)3}. 


We can take the cartesian product of more than two sets. Given sets X,Y, Z, 
observe that there is a natural bijection 


(5.26) b:(X x Y)x Z5Xx(¥ x Z) 
defined by 
(5.27) b((a,y), 2) := (2, (y,z)) forte X,yeY, ze Z. 
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So, instead of distinguishing between the two versions of the triple cartesian prod- 
uct, we simply write either as X x Y x Z. 

Actually, instead of taking cartesian products two sets at a time and ap- 
pealing to the “associativity” of the cartesian product, we do the following. Let 


X 1, X9,...,X» be sets, where n is an integer greater than 1. Theirn-fold cartesian 
is defined by 


(5.28) ll% = X) x Xo ) re Xn = {(@1,@2,---,;%n) 125 € Xj for 1 <i< n}. 
i=1 

Here, (#1,%2,..-,;%p) is called an ordered n-tuple. We define two n-tuples 

(@1,%2,..-,X%n) and (yi, y2,---;Yn) to be equal if and only if x; = y; for alll <i< 

n. Given a set X, we denote 


(5.29) MP a NEN Ki KN, 
ec —<—<_— 
n times 


We can visualize triple cartesian products since we live in a 3-dimensional world, 
but it is a stretch to visualize cartesian products of 4 or more sets. 


Remark 5.21 (Thinking decimally). We can even consider 
For example, recall from (411i) that 
Rio = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9} 


is the set of remainders for division by 10. Let X; = Ryo for i € Zt. Then we can 
take the countably infinite cartesian product of Rio with itself; we define: 


(5.30)  (Rio)* = TJ] X= {! :Z* + (J Xi: f(i) € X; for allie a}. 


ieZ+ ieZ+ 
In other words, an element of (Rio)2” is an infinite sequence of digits 


FO), FQ fSly xs. 


How is this set similar to the unit interval [0, 1] of real numbers? How is this different 
from [0,1]? Hint: A real number may have more than one decimal expansion. For 
example, 0.1 = 0.09. 


5.2.1. Euclidean space. In calculus, we consider 1-, 2-, and 3-dimensional Eu- 
clidean spaces. The n-dimensional Euclidean space|can be concretely defined as 
(5.31) R” = {(1,22,...,%n): a; ER, 1 <i<n}. 

This is the n-fold cartesian product of R with itself. We also call R” Euclidean 
n-space. 

Sets may be endowed with additional structures, such as algebraic or geometric 
structures for example. Euclidean space naturally has a/vector space|structure (we 
assume that you are familiar with vector spaces) where vector addition is defined 
by: For x = (21, £2, eran ifn) and = (Yi, Y2, bs ies 
(5.32) xb y = (i+ yi, 2+ ya, +-+5 2a + Ya), 
and scalar multiplication is defined by: For c € R, 


(5.33) CX = (CX, C¥Q,..., CX). 
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Figure 5.2.2. A visualization of R? = Rx RxR. The dot at the intersection 
of the axes is the origin (0,0,0). Each axis represents a ray emanating from 
the origin. For example, the z-axis is the set {(0,0,z) : z > 0}. An element 
(a, b,c) is visualized. 


See the “Vector space” Wikipedia link for visualizations of vector addition and 
scalar multiplication. 


It is easy to see that Euclidean space, with this vector addition and scalar 


multiplication, satisfies the Observe that we can consider 


vector addition as a function from R” x R” to R”, and scalar multiplication as a 
function from R x R” to R”. We discuss functions in more generality in §5.3] below. 


5.2.2. Euclidean dot product and norm. Euclidean space is usually endowed 
with its naturally defined |Euclidean dot product, given by 
(5.34) XV = UY + Lay2 +++ + LnYn- 


As we will see below, this is a geometric structure in the guise of an algebraic 
structure. 


Lemma 5.22. The Euclidean dot product satisfies the following properties: 
(1) x-y=y-x. 
(2) (cx)-y =c(y-x). 
(3) (k+y):z2=x-at+y-z. 


Because of these properties, the Euclidean dot product is called a symmetric, 
form. Namely, property (1) is called symmetry, and in the presence of symmetry, 
properties (2) and (3) combined are called bilinearity. 


Exercise 5.14. Prove Lemma |5.22 


The norm (or absolute value) is defined by 


(5.35) |x| = Yeu = fo? tal te a2. 
With the norm, we can define the distance between two points as 
(5.36) dist(x, y) := |x —y|. 


We leave it to you, dear reader, to check that when n = 2, this definition of distance 
follows from the Pythagorean Theorem [3.4] 
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A more abstract way to define Euclidean n-space is as an n-dimensional [vector] 
[Space| with a dot product, We will not go down this route: 


“Because you have been down there, |Neo| You know that road. 
You know exactly where it ends. And I know that’s not where you want 
to be.” 


If, on the other hand, you wish go down this interesting and important route, we 
note that there are many excellent books on linear algebra. 


Exercise 5.15. Let r be a positive real number, and let A = [—r,r] x [—r,r] be a 
square (with side lengths 2r) in R?. Let B = {(z,y) € R? : a7 + y? < 2r?}. Prove 
that 


(5.37) ACB. 


Figure 5.2.3. A square peg in a round hole. 


5.2.3. Euclidean vector projection. 


Solved Problem 5.23 (Euclidean vector projection). Let w be a non-zero vector 


in R” and let v be a vector in R”. Derive the formula for the vector projection, of 


v onto w. 


Solution. The vector projection proj,,(v) of v onto w is 


(5.38) cw, 
where c € R is determined by the equation 
(5.39) (v—cw)-w=0, 


which says that the vector v — proj,,(v) is orthogonal (perpendicular) to the vector 
w (suggestion: draw a picture!). That is, 


(5.40) oo 
|w|? 
Thus the vector projection is 
; vw 
(5.41) proj,,(v) = Tw 


See Figure [5.2.4] for a visualization of the vector projection. 


Observe that in terms of the unit vector u := Tey? this says that 


(5.42) projw(Vv) = projy(v) = (v-u)u. 
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Figure 5.2.4. The vectors v and w and the vector projection of v onto w. 


5.2.4. The Cauchy—Schwarz inequality. We have the following fundamental 
inequality for vectors in R” regarding their lengths and dot products. 


Theorem 5.24 (Cauchy—Schwarz inequality). For any x,y € R”, we have 
(5.43) Ix-y| < |x| ly. 


Equality holds in (5.43) if and only if one of the vectors x or y is a multiple of the 
other, t.e., if and only if they are linearly dependent. 


Proof. If y = 0, then the theorem is easy to prove. So, without loss of generality, 
we may assume that y 4 0. Let \ € R be a real number to be chosen below. 
Whatever is, we have 


(5.44) 0 < |x —Ay/? 
= (x — Ay): (x — Ay) 
=x-x+Ay:-Ay—x-Ay-Ay:-x 
= |x|? + A? ly? — 2Ax-y. 
The vector projection of x onto y is equal to yey: Motivated by this, we choose 
poe”. 
= ye 
which is well-defined since |y| 4 0. With this choice of A, the quantity x — Ay in 


(6.44) is equal to the vector x minus its orthogonal projection onto the vector y. 
Furthermore, inequality (5.44) becomes 


2 
o<|xl?+ (=) ly|? 2E pee ec . ) : 


(5.45) 


Therefore 
(5.46) (x+y)? < |x/?ly|?. 
Inequality (6.43) follows from taking square roots of both non-negative sides. 
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Finally, we characterize the equality case. By the discussion above, equality 
holds in if and only if we have equality in (6.46). This, in turn, is equivalent 
to equality in (6.43). Now, equality in (6.44) holds if and only if either y = 0 or x 
is a scalar multiple of y. This completes the proof of the theorem. 


In what sense have we been an architect (in the sense of §3.1.4) for the proof 
above? 


Let 6 be the angle between non-zero vectors x and y. Since the vector projection 
of x onto y has length aT? we have 


xy 
Wi a EY 
(5.47) cos(@) = ——- = —_. 
Ix} [x/ ly! 
That is, 
= xy 
(5.48) 6 =cos! (=~) : 
Ix/ly| 


Alternatively, we can use the law of cosines to derive (6.47). We have a triangle 
in Figure [5.2.4] with side lengths |x|, |y|, |x — y| and angle @. By the law of cosines 
for this triangle, we have 


(5.49) Ix — yl? = |x|? + ly? — 2|x]|y| cos(4). 
Exercise 5.16. Rederive (5.47) from equality (5.49). 


A fundamentally important consequence of the Cauchy—Schwarz inequality is: 
Corollary 5.25 (Triangle inequality). For any x,y € R", we have 
(5.50) Ix+y| < [x] + lyl. 
Exercise 5.17. Prove Corollary Hint: Square both sides and expand. 

The reason why is called the triangle inequality is that it implies for 
every a,b,c € R”, 
(5.51) dist(a,c) < dist(a, b) + dist(b, c), 


where distance is defined by (5.36). We leave it to you, dear reader, to prove this. 
In other words, going straight from point a to point c is at least as short as going 
straight from a to b and then going straight from b to c. 


5.2.5. Planes. With respect to the vector space structure of R”, we can define 


special subsets. We say that a subset DL of R” is a (a.k.a. vector 


subspace) provided it has the property that if x,y € L and a,b eR, then 
(5.52) ax + by € L. 


Given x € R”, the line passing through x and the origin 0 is the set 
(5.53) ly = {tx:t ER}. 
Observe that if Z is a linear subspace of R” and if x € L, then @, Cc L. 


It turns out that if D is a linear subspace of R”, then L with the induced vector 
addition and scalar multiplication is itself a vector space of dimension between 0 
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Figure 5.2.5. A linear subspace is a “plane” passing through the origin. 


and n, inclusive. The only subspace of dimension 0 is {0}, and the only subspace 
of dimension n is R”. 


Exercise 5.18. Let ¢: R" > R be allinear function, Prove that 
(5.54) ker(@) := {x € R” : &(x) = 0} 


is a linear subspace. 


We say that a subset A of R” is an if there exists a € A such 


that the subset 
(5.55) Ag :={x—a:xe€ A} 
is a linear subspace of R”. 

We see that a linear subspace is necessarily an affine subspace since if D is 
a linear subspace, then Lo = L, where O is the origin. We won’t go into detail 
about the linear algebra of affine subspaces, but an affine subspace A of R” has a 
dimension d, where 0 < d < nis an integer. We may think of A as a “copy” of R?@ in 
IR”. In other words, A is a d-dimensional plane in R”. Now, linear subspaces must 
contain the origin (see Figure [5.2.5), whereas affine subspaces may not necessarily 
contain the origin. See Figure [5.2.6 


z 


x 


Figure 5.2.6. An affine space is a “plane” which may or may not pass through 
the origin. 
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Exercise 5.19. Show that, given a non-zero vector v € R” andce R, 
(5.56) {x € R" :x-v=c} 
is an affine subspace. 

Explain why it is a linear subspace if and only if c = 0. 


Lemma 5.26. A subset A of R" is an affine subspace if and only if for alla € A, 
the subset Ag = {x—a:x € A} is a linear subspace of R”. 


Exercise 5.20. Prove Lemma[5.26| Note that the difference is the replacement of 
the existential quantifier “there exists” with the universal quantifier “for all”. 


Lemma 5.27. [f L is a linear subspace of R” and if xo € R”, then 
(5.57) L+xo:= {x+xo:xeEL} 
is an affine subspace of R”. 


Exercise 5.21. Prove Lemma b.27]| 


5.2.6. Spheres. In addition to special linear subsets of Euclidean space, we 
have special non-linear subsets. 


The (n — 1)-dimensional |sphere of radius r € R* centered at the origin 0 in 
R” is defined by 


(5.58) Sr) = {ee R”: x) =r}. 


We also call $"~1(r) the (n — 1)-sphere of radius r. 
Writing a point in terms of its Euclidean coordinates as 


(5.59) MSH (ay We ae), 


Figure 5.2.7. A 2-sphere. The curves in the plane are the images under 
stereographic projection (see {5.2.7] below) of the curves in the sphere. From 


Virtual Mathematics Museum by the 3D-XplorMath Consortium, Public Do- 


main. 
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we have that 
(5.60) S°l(r) = {(a1,22,...,2n) ER”: a? +03 +--+ +02 = 7°}, 


where we denote x? := (2;)?. 


In low dimensions, we often use the indexless coordinates z,y,z,.... So the 
1-sphere, a.k.a. the |circle| of radius r is given by 
(5.61) Slr) ={(z,y) € R? a? +7? = 1°}, 
and the 2-sphere, a.k.a. the sphere, of radius r is given by 
(5.62) S7(r) ={(2,y,z) ER? sa? +y? +2? =r’. 


The unit (n —1)-sphere is defined to be the (n — 1)-sphere of radius 1. We use 
the notation 


(5.63) sr} = gr-1(1), 


5.2.7. Stereographic projection. An interesting and useful function defined 
on spheres is stereographic projection, which we now describe. Functions, in more 
generality, will be discussed in the next section. 

Let S”~! be the unit (n—1)-sphere as defined in (5.63). Let n = (0,...,0,1) € 
S$"! be the “north pole”. Let L be the hyperplane ((n — 1)-dimensional subspace 
of R”) defined by 


(5.64) EL={y €R":y-n=—-1}={(z,-1):2e R*® 1} =R™! x {-1]}. 


Then L is tangent to $"~! at the south pole s := —n = (0,...,0,—1). Note also 
that another way to express L is as the set of points x = (21,%2,...,2%n) in R” 
satisfying 7, = —1. 


We will define stereographic projection f from the sphere minus a point S”—~!— 
{n} to the hyperplane L. The idea is that given a point x on the sphere minus the 
point n, the unique line passing through n and x intersects the hyperplane L at a 
unique point, which we define to be f(x). 


Define stereographic projection f : $”~'— {n} > L by 
2x—(1+x-n)n 
(5.65) f(x) = 2 


1—-x-n 
for x € S"~!—{n}. See Figure[5.2.8] for a visualization of stereographic projection. 
In this picture, we have n = 3 and 


(5.66) feesne) = (Fe Pe). 


Observe that f(s) =s since s-n = —1. The equator of $"~! consists of those 
points x € S"~+ satisfying x -n = 0. For such points, f(x) = 2x —n. So f maps 
the equator (which is a circle of radius 1) to a circle of radius 2 in the hyperplane 


Solved Problem 5.28 (Stereographic projection from the sphere to the plane). 
Derive formula for stereographic projection. That is, show for any x € 
S"-1_{n} that f(x) defined by (5.65) is the unique point in the intersection of L 
and the line passing through n and x. 
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Figure 5.2.8. Visualizing stereographic projection. Meridians and parallels 
are mapped to lines and circles, respectively. From the Virtual Mathematics 


Museum by the 3D-XplorMath Consortium, Public Domain. 


Solution. Let us derive the formula for the stereographic projection f(x) of 
a point x. This is equal to the point y in the hyperplane L which also lies on the 
unique line determined by n and x. So 


(5.67) y =(1-t)n+tx, 
where ¢ is determined by the equation y € L; that is, 
(5.68) -l=y-n=(1-t)+tx-n. 
Hence 
2 1+x-n 

: = ——— h 1 — : 
(5.69) t =< so that t (ae 
We then conclude that 

1l+x-n 2 

5.70 =y= 
( ) P(x) =y 1l—-x-n 1l—-x-n 


Equation (5.65) follows easily from this. 0 


Exercise 5.22 (Stereographic projection). Let f : S"~'—{n} —> L be stereographic 


projection. 


(i) Prove that f(x)-n = —1 for allx € S"~!—{n}. This proves that f maps 
into L as advertised; i.e., f is well-defined. 


(ii) Prove that the function g: L > S"~! —{n} defined by 
4y + (|y|?-—1)n 
(5.71) aly) == vee 


ly? +3 
is the inverse of the function f. 


For simplicity, let n = 3. Then 
(4u, 4v, u2 + v2 — 4) 
uz+u2+4 


(5.72) g(u,v,-1) = 
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By the exercise, we also have that f is a bijecton from $?—{n} to R? x {—1}, which 
in turn is bijective with R?. So, in a sense, we can think of $? as being “equal to” 
R? with a point added, which we may call “infinity”, 00. We write S? & R?U {oo}. 
So oo € R? U {oo} corresponds to n € S?. On the 2-sphere S?, if we start at the 
south pole s = (0,0,—1) and travel along any meridian, then we end up at the 
north pole (0,0,1) after travelling distance 7. Correspondingly, via stereographic 
projection, if in the plane R? we start at the origin (0,0) and travel along any ray, 
we can go an infinite distance, and in the limit we reach the added point sof 


5.2.8. Real projective space. Related to spheres are projective spaces. 
Exercise 5.23. A line through the origin in R"*! may be described as 
(5.73) {iv :teER}, 
where v € R"*! — {0} is a non-zero vector. 

Prove that any line L passing through the origin in R"*+ intersects the unit 


sphere S” at exactly two points, which are antipodes of each other. 


Exercise 5.24. Define RP” to be the set of lines passing through the origin in 
R"*!. Define the function f : S" + RP” by f(x) is the unique line passing 
through the origin in R"*! containing x. Prove that such a function indeed exists 
and for each L € RP” we have |f~\(L)| = 2. 


RP” is called the (n-dimensional) real projective space. We call RP? the 
real projective plane. 


Figure 5.2.9. A This is a model for the real projective plane RP?. 
Fun fact: RP? co blersediin R® but cannot belembeddedlin R?. From 
the Virtual Mathematics Museum by the 3D-XplorMath Consortium] Public 


Domain. 


2So0 we say that stereographic projection does not preserve distances. But, amazingly, it preserves 
angles. Alas, we do not prove this here. 
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5.3. Functions and their properties 


Sets without functions are like goalposts without balls. There is not much point 
playing the game without them! 


We have so far been discussing functions from a working point of view. Now 
we will go a bit into the rigorous foundations for the notion of function. 


5.3.1. What is a function? When you think about a function, probably some- 

thing like y = sin(a) comes to mind. Other functions are polynomials, and you’ve 
probably spent some time considering quadratic functions such as f(x) = ax? +be+ 
c, where a,b,c are real numbers. But what is the general definition of a function? 
Definition 5.29. Given two sets X and Y, alfunction| f from X to Y assigns to 
each element in X a unique element in Y. So, to each x € X, the function f assigns 
a unique element of Y, which we denote by f(x). 
Example 5.30. Let X = {a,b,c} and let Y = {1,2,3}. Defining a function f 
from X to Y is equivalent to specifying elements f(a), f(b), and f(c) in Y. For 
example, we can define f by f(a) = 3, f(b) = 1, and f(c) = 3. See Figure 5.3.1] for 
a visualization of this function. 


Historical note. The song {ABC” by The Jackson 5 (not to be confused with 
Maroon 5) was a #1 hit in the year 1970. Part of the lyrics is: “A B C, one, two, 
three, ...”. 


Figure 5.3.1. Visualizing the function f in Example[5.30 
The set X is called the domain) of f and the set Y is called the |;codomain| of 
f. We denote the function usually by 
f:X 7 Y. 
We also use the following notation: 
xry f(x). 
Here we can think of x as the input|and f(x) as the output. 
Exercise 5.25. Again let X = {a,b,c} and let Y = {1, 2,3}. 
(1) If we define f(a) =1 and f(b) = 2, does this define a function f :X > Y? 
(2) If we define g(a) = 1, g(b) = 2, and g(c) = 4, does this define a function 
g:X ~Y? 
(3) If we define h(a) = 1, h(b) = 2, h(c) = 3, and h(d) = 1, does this define a 
functionh: X > Y? 
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5.3.2. Examples of functions. An important, albeit somewhat trivial looking, 


function is the identity function. Given a set X, the identity function of X is 


the function Ix : X — X defined by Ix (x) = x for all x € X. 


na n n 


y =Ir(x)=2 —— 


Figure 5.3.2. The identity function Ig of the set R viewed in two ways. Left: 
The graph of Jp is a diagonal line in R? = R x R. Right: The arrows show 
that Ig maps each real number to itself. 


Many functions are defined by a “formula”. 


Example 5.31. Consider the function y = x?. What we really mean by this is 


the function f from R to R defined by f(a) = x?. We will call f the squaring 
function. 


Definition 5.32. Let X be a set. A|sequence|of points in the set X is a function 
f:Z° 3X. 


For example, a sequence of real numbers is a function f : Zt — R. However, 
we usually write a sequence using the notation a,,@2,a3,... or {a@,}°,. The 
correspondence between the definition and this notation is given by the equality 


Qn, =f(n) forne Zt. 
For example, consider the harmonic sequence ea ee This is equivalent to the 
function f : Zt — R defined by f(n) = +4. 
Remark 5.33. We should not confuse a sequence {a,,}°2, with the set {a,:n € 


Z*}. Indeed, the latter is the image of the sequence as a function; see 


below. For example, {(—1)"}°2, is an alternating sequence —1,1,—1,1,-—1,1,..., 


whereas {(—1)":n € Zt} = {-1, 1} is a set with two elements. 


Example 5.34. Let S$ be an infinite subset of Z+. Associated to S, we can define 
a strictly increasing sequence f : Zt + Z* by induction as follows. 


(1) f(1) is the minimum element of S', which exists by the well-ordering prin- 
ciple. 
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Figure 5.3.3. The harmonic sequence {A}o2, viewed in two ways. Left: As 
a function mapping inputs (positive integers) to their reciprocals. Right: The 
colored dots represent the graph of the function n +> 1/n. 


(2) Given a positive integer k for which f(1), f(2),...,f(k) are defined, we 
define f(k + 1) to be the minimum element of the set S — {f(1), f(2),..., f(A}. 
Note that this minimum element exists because S' is infinite. 


As an easy example of all of this, the function associated to the subset EN Zt 
of even positive integers is the function defined by f(n) = 2n. 


5.3.3*. Generating functions. Given a sequence {a,,}°,, its|generating func 
[tion is defined to be the power series| 


(5.74) x ayn". 
n=0 


So the generating function of a sequence is not a function per se, for the radius of 
convergence of the series may be finite or even zero. But generating functions are 
useful ways to encode the information, and hence properties, of a sequence. 


Example 5.35. Consider the constant sequence {1}°29. Its generating function 
is the power series 


(5.75) we 
n=0 


If |z| < 1, then this series converges to ;+.. 


Let {fn} be the Fibonacci sequence (defined in {2.5), where fo := 0: 
(5.76) 0,1,1,2,3,5,8, 13, 21,34, 55, 89,.... 


Its generating function is 


(5.77) ne” 
n=0 
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We now prove that (5.77) is the power series for the {rational function’ 
(5.78) f(x) = ———_ 
Define, as in (2.116), 


(5.79) a= = 


2 2 
which satisfy a+ 6 = 1 and a@ = —1. From this we see that we can factor the 
denominator of (6.78) as 
(5.80) 1-2-2? =(1-az)(1—- 82). 


Using partial fractions, we can simplify the rational function with a quadratic de- 


nominator in (5.78) as the difference of two rational functions with linear denomi- 
nators: 


ea ec 1 1 1 
; l-av-2 VW5\l-ar 1-62)" 
This leads to the geometric series 
i is 1 
= non fi uae 4 
(=a Dee or aap 
1 ~ 1 
——— = Bra" for |x| << —. 
1— Bx » a [3| 
We have 
(5.82) a pei oS ee 


n=0 n=0 
for |a| < Te where we used Binet’s formula (2.117). 


In the argument above, we used Binet’s formula to prove that 432 
SS fnx”. Conversely, we can prove this last formula to give a new derivation of 
Binet’s formula. To see this, let 


(5.83) Fee fe => faery a 
n=0 n=1 


n=2 


where the second two equalities follow from fp = 0 and f; = 1. We then compute 
using the Fibonacci number recursive definition (2.99) that 


F(a) =a+ yee + fn—2)2” 


n=2 


Co Co 
-1 —2,,2 
=@o+ 5 fn—10" r+ S fn—20” x 
n=2 n=2 
ioe) co 
=x£+ ) fran + ) fre” a? 
n=1 n=0 


=2+aF(2)+2°F(z). 
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From this it follows that 
(5.84) F(x) = 


l—-x-<22° 


x 


From seeing this new derivation of Binet’s formula using a generating function, we 
can imagine the general usefulness of generating functions. 


5.3.4. Equality of functions. We say that two functions f : X — Y and 
g:Z— W are equal provided their domains are equal, their codomains are equal, 
and all of their values are equal, that is, provided: 

(1) X=ZandY=W. 
(2) Forallae X =Z, f(x) =g(a4) CY =W. 
Example 5.36. Let X be aset and let f : X — R bea function. Define g: X — R 
by 
(5.85) g(x) = (-1)*f(a) for ce X. 
Then f = g (as functions). 


5.3.5. New functions out of old functions. Given a function f : X — Y, there 
are few simple things we can do to the function to create a new function. In other 
words, just as we have operations on sets to create new sets, we have operations on 
functions to create new functions. 


5.3.5.1. Images. The |image| of a function f : X — Y is defined by 
Im(f) ={f(v): aE X} CY. 
The terminology range is also used for image. 


When we think about real-valued functions of a single variable, we often picture 
their graph. For example, consider the sine function f : R > R defined by f(x) = 
sin xz. Its graph looks like a wave. Indeed, the periodicity of the sine function, that 
is, f(~@ +27) = f(z) for all « € R, means that the graph of sine repeats itself with 
period 27. 


Let sin denote the sine function and let cos denote the cosine function. Then 
(5.86) Im(sin) = [—1, 1] = Im(cos). 

If sq: R > R is the squaring function defined by sq(a) = x”, then 
(5.87) Im(sq) = R. 


Indeed, for every y € R=, we have sq(,/y) = (./y)” = y. Moreover, for every x € R, 
we have that sq(x) > 0. 


5.3.5.2. Restrictions and extensions. Let A C X. We can define the|restriction| 
of f to A by 


(5.88) fla: Az X, 
where 


(5.89) fla(z) = f(x) for ve A. 
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For example, we can restrict the sine function sin : R — R to the subset 
[—7/2, 7/2]. On this subset, sin is (strictly) increasing since its derivative sin’ (x) = 
cos(x) is positive for x € (—7/2, 7/2). See Figure 5.3.4] 


Figure 5.3.4. The graph of the restriction of the sine function to the closed 
interval [—7/2, 7/2] is in red. 


Suppose, on the other hand, X C B. Suppose g: B—- Y satisfies 
(5.90) g(a) = f(a) forxe X. 


Then we say that g is an extension of f. Note that in this case f = glx. 

Besides restricting and extending the domain, we can restrict and extend the 
codomain: Suppose that Y Cc C. Then we can define h : X > C by h(x) = f(x) 
for « € X. Note that h is well-defined since f(x) € Y implies that f(x) € C. 

If D is a set satisfying Im(f) C D C Y, then we can define k : X > D by 
k(a) = f(a) for « € X. Here, k is well-defined since for each « € X we have 
f(x) € Im(f) Cc D. In particular, we have the restriction k : X — Im(f) defined 
by k(x) = f(x) for a € X. 

So, to summarize, without any essential change to how the function is defined, 
we can shrink the domain or enlarge the codomain. We can shrink the codomain 
provided it is to a set that contains the image. If we enlarge the domain, we have 
to specify how we define the function outside its original domain. 

5.3.5.3. Compositions. Let 

f[: xX 7 Y 
be a function. Then, given 7 € X, we have that f(x) € Y. So if we have another 
function 

g:Y >Z, 
we can then take g(f(a)) to obtain an element of Z. This combined process is the 
composition] of the functions f and g. That is, we define 

gof:xX 92 
by 
go f(x)=g(f(a)) forxe x. 


Remark 5.37. For the composition to be well-defined, we just need that the image 
of f is contained in the domain of g. Note that in this case, by either enlarging the 
codomain of f or restricting the domain of g, if necessary, we can make them equal 
as sets. 
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xX Y Z 


Figure 5.3.5. The function f maps from Batduck to Superduck, and the func- 
tion g maps from Superduck to Starduck. Their composition go f maps from 
Batduck to Starduck. From “‘The duck pond’: showcase of TikZ-drawn an- 
imals/ducks” Stack Exchange answer by samcarter_is_at_topanswers.xyz. Li- 
censed under Creative Commons Attribution-Share Alike 4.0 International Li- 
cense (https://creativecommons.org/licenses/by-sa/4.0/deed.en). 


Example 5.38. Define a: R” > R” by 


(5.91) a(x) =—-x for xe R”. 

Then 

(5.92) aca = Ipn, 

where Ign is the identity function of R”. Indeed, we calculate that 
(5.93) (0 a)(x) = a(a(x)) = o(—x) = —(—-x) =x. 


The function a is called the/antipodal map! It is an example of an linvolution| 


that is, a is its own inverse; see §5.4.4] below. 


Exercise 5.26 (Mirror, mirror on the wall). Let u be a unit vector in R”. Prove 
that the function f : RR" + R” defined by 


(5.94) f:x7x-2(x-u)u 
is an involution. 


Can you figure out (geometrically) what kind of function this is? Hint: Firstly, 
what are the fixed points of the function? By definition, a point x € X is a fixed 
point of a function f: X > X if f(a) =a. 


An example from calculus class is: f(x) = 2? 


functions have domain and codomain R. Then 
go f(a) = sin(x), fog(x) =sin?(2). 


Caveat: The sine function has often been confused with Lombard Street, © 
Here is another example. Define g : Z — {0,1} by 


0 if n is even, 
(5.95) ain) = { 


and g(y) = siny. Both of these 


1 ifn is odd 
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y f(c) = sin*(2) 

t x 
+ 7 20 
y f(a) = sin(a? 
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Figure 5.3.6. The compositions of the sine and squaring functions. The order 
of composition makes a difference! 


(g is the characteristic function of the subset of odd integers in the set of integers), 
and for each positive integer k define f, : Z— Z by 


(5.96) fx(n) = 
We leave it to the reader to show that 
(5.97) gofg=g forallkeZt. 


Exercise 5.27. Prove go f, =g in (5.97). 


Some basic properties of composite functions are given by: 


Theorem 5.39. Suppose that f: X ~ Y,g:Y > Z, andh: Z > W are 
functions. Then: 

(1) (Associativity) (hog)o f=ho(gof). 

(2) (Identity function) folx =f=TIyof. 

Exercise 5.28. Prove Theorem [5.39] 


5.3.5.4. Graphs. We are accustomed to visualizing functions by their graphs, 
at least for real-valued functions of one or two variables. 


Formally, the |graphjof a function f : X > Y is defined by 
(5.98) Gy ={(a, f(z)): cE XFCXxY. 
Equivalently, 

Gr ={(z,y): 2 EX, y= flx)}. 
Example 5.40. Define the function 
cos(x 
(5.99) f(a,y) =In (=) 


on the largest subset of elements (a, y) in R? for which the expression on the right- 
hand side is well-defined. Call D the domain of f. The graph of f is the (graphical) 
surface in R? = R? x R defined by 


(5.100) Gy= { (sain (=) : (x,y) € p} 
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In what way does the domain D of f remind you of an (infinite) chessboard? Hint: 
A ratio is well-defined if and only if the denominator is non-zero. The logarithm of 
a number is well-defined if and only if the number is positive. 


Figure 5.3.7. Scherk’s surface. Wikimedia Commons, author: Erminia Nac- 
carato (Miriane), licensed under GNU Free Documentation License, Version 
1.2 or any later version. 


The surface Gy is known as |Scherk’s first surface, This is an example of a 
a concept in differential geometry related to |soap bubbles 


Exercise 5.29. Find the domain of the function defined in Example (5.40) That 
is, find the largest subset of R? on which the expression f(x,y) given by (6.99) is 
well-defined. 

5.3.6. The floor and ceiling functions. The 
(5.101) xr |x| 


of a real number input x is defined to be the greatest integer less than or equal to 
x. That is, 


(5.102) |x| :=sup{n €Z:n< a} =max{n€ Z:n< zc}. 
The ceiling function 
(5.103) rey [a] 


of a real number input x is defined to be the least integer greater than or equal to 
x. That is, 


(5.104) [x] :=inf{n€ Z:n>a}=min{neZ:n> zc}. 
We leave it as an exercise to graph the ceiling function. 


The fractional part (or decimal part) of a non-negative real number is defined 
by 


(5.105) frac(a7) := a — [a]; 
that is, 
(5.106) |x| + frac(x) = z. 


The functions above are nicely understood by drawing their graphs. 
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Figure 5.3.8. The graph of the floor function. 
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Figure 5.3.9. The graph of the fractional part function. 


5.4. Types of functions: Injections, surjections, and bijections 


For any function f : X — Y, simply because it is a function, for any elements x1 
and x9 of X, 


(5.107) if f(ai) A f (wo), then 21) 4 x. 


This is simply because: If x; = x2, then by the definition of function we have that 
f (a1) = f (x2). The preceding statement is the contrapositive of this statement. 


5.4.1. Injections, surjections, and bijections. 


5.4.1.1. Surjections. For a function, is every element of the codomain the image 
of some element of the domain? Not necessarily, but if it is, then we have the 
following. 


Definition 5.41. We say that f : X — Y is|surjective if for every y € Y, there 
exists « € X such that f (2) = y. We also call f a surjection. 


In other words, a function f is surjective if and only if f takes all values in Y. 
Equivalently, the image of f is Y: 


(5.108) Im(f) = Y. 
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Examples. (1) The (linear) function f : R > R defined by f (”) = 2x — 3 is 
surjective. Indeed, for every y € R, there exists x € R such that y = 2x —3; namely 


— yt3 
L=*y- 


(2) The function g : R > R defined by g (x) = x? is not surjective. For example, 
for —1 € R there does not exist  € R such that x? = —1. 


Figure 5.4.1. For example, the function f : X — Y defined by f(a) = 2, 
f(b) =1, f(c) = 2, f(d) = 3 is a surjection. 


Solved Problem 5.42. Define f :Z* + Z* by 


+ ifn is divisible by 18, 
n- ifn is not divisible by 13. 


Prove that f is surjective. 
Solution. Let k € Zt. Then n = 13k is divisible by 13. Therefore 


(5.109) f(m) = 7 =. 


This proves that f is a surjection. 


Exercise 5.30. Show that any cubic function f : R > R, defined by f(a) = 
ax? + bx? + ca +d, where a,b,c,d € R with a £0, is surjective. Hint: Take limits 
as x — too and use the Intermediate Value Theorem from calculus. 


Exercise 5.31. Let Y be a set and let f : Z —> Y be a function satisfying the 
following properties: 


(1) f is a surjection, 
(2) f(2n) = f(2n+1) for alln € Z. 
Let E C Z denote the set of even integers. Define the function 
g: EY, where g(k)=f(k) fork e E. 


Prove that g is a surjection. Hint: You may wish to consider two cases. Can you 
draw a picture to visualize f and g? 
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5.4.1.2. Injections. For a function, can two different elements in the domain 
have the same image? Possibly, but if we require not, then we have the following. 


Definition 5.43. We say that f : X — Y is|injective if x,,72 € X with 1; 4 x2 
implies f (11) 4 f (x2). We also call f an injection. 


Note that, equivalently, f : X — Y is injective if 21,72 € X with f(a) = 
f (2) implies 21 = x2. Also observe that the condition in Definition for a 
function to be injective is the converse of (5.107). 


Example 5.44. (1) The above function f : R > R defined by f (x) = 2x — 3 is 
injective. Indeed, if 71, x2 € R satisfy 2x, — 3 = 2x2 — 3, then 22%, = 2x2, which 
implies 71 = 2. 


(2) The function g : R > R defined by g (x) = x? is not injective. For example, 
for —1 4 1 we have (—1)” = 12. 

For the sets X and Y in Figure[5.4.]] can you convince yourself that there does 
not exist an injection g: X > Y? 

On the other hand, for the sets X and Y in Figure[b.4.]] there exists an injection 
h:Y — X. For example, we can define h by h(1) = b, h(2) = a, and h(3) = d. 
Exercise 5.32. Define g : Zt — Z by g(n) = (—1)"n. Prove that g is injective, 
but not surjective. 


Exercise 5.33. Define f : Zt > Z* by 
(5.110) re { n if n is even, 


n+l : ; 
"> ifn is odd. 


Is f injective? Is f surjective? For each k € Z*, what integers does the set f—'(k) 
consist of ? 


Exercise 5.34. Let X and Y be disjoint sets. Suppose that f : X — Y and 
g:Y¥ +X are injections. Prove thath: X UY + X UY, defined by 
(5.111) h(a) = f(x) if xe X and h(x) =g(2) if cEY, 
is an injection. 
Exercise 5.35. Let m € Z* and leta € R,,—{0} =N»_1. Define fa: Rm 7 Rm 
by 
(5.112) fa(z) = r(az), 
wherer: Z— R,, is the remainder function. 

(1) For which a € R3 — {0} is fa: Rg > Rg an injection? 

(2) For which a € Ra — {0} is fa : Ra > Ra an injection? 

(3) Suppose that a € Rm, — {0} satisfies gcd(a,m) > 1. Is fa: Rm 4 Rm an 
injection? 

(4) Suppose that a € Rm — {0} satisfies gcd(a,m) = 1. Is fa: Rn 4 Rm an 
injection? 


Exercise 5.36. Answer the same questions as in the previous exercise except with 
“injection” replaced by “surjection”. (Is it easier to answer these questions now 
that you have done the previous exercise?) 
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5.4.1.3. Bijections. When does a function provide a one-to-one correspondence 
between the elements of the domain and the elements of the codomain? In the 
following case. 


Definition 5.45. f:X > Y is if f is surjective and injective. We also 
call f a bijection. 


Example 5.46. (1) The above function f : R > R defined by f (x) = 2x — 3 is 
bijective since it is both surjective and injective. 


(2) Of course, g : R — R defined by g (x) = 2? is not bijective since it is neither 
surjective nor injective (to be not bijective it only needs to be not surjective or 
not injective). On the other hand, h : R2 —> R@ defined by h(x) = 2? is bijective. 


Exercise 5.37. Define f;,: R— R by fx(x) = x*. Determine whether each of the 


following functions is a surjection, injection, and/or bijection: 


(1) fr, where k is an odd positive integer, 


(2) fr, where k is an even positive integer. 


Now let f : X — Y bea function. Then we can restrict the codomain Y to the 
subset Im(f) and define the function 


(5.113) f:X 7 In(f) 


by f(x) := f(x) for all 2 € X. The reason why this function is well-defined is 
simply because for all  € X we have f(x) = f(x) € Im(f). We leave it to you, 
dear reader, to show that f is a surjection. In particular, if f is an injection, then 


f is a bijection. 


5.4.2. The pre-image. 


Question: What do you call it when you fire pre-images at your enemy? 


Answer: A |preimage attack 


Let f : X + Y bea function. Let B be a subset of Y. The |pre-image of B 
is the set of elements in X whose values are in B. That is, 


(5.114) f-'(B) ={2@ eX: f(z) € Bh. 
For example, f-'(Y) = X since f(x) € Y for all x € X. If y € Y, then we denote 
(5.115) fy) = fy) = {a eX: f(a) = y}- 


So the pre-image of a subset of the codomain is the set of elements of the 
domain whose images are elements of the subset of the codomain. 

Another way to say that f is a surjection is: For all y € Y, the pre-image 
f—'(y) is non-empty (i.e., consists of at least one element). 

Another way to say that f is an injection is: For all y € Y, the pre-image 
f~*(y) consists of at most one element. 

Another way to say that f is a bijection is: For all y € Y, the pre-image f~'(y) 
consists of exactly one element. 
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Example 5.47. Define the function g : Z > {0,1} by 
0 if n is even, 
1 ifn is odd. 


We then have that g~1(0) is the set E of even integers and that g~1(1) is the set O 
of odd integers. 


Example 5.48. Define h : Z — Z by h(n) = 2n. If k is odd, then h~1(k) = 0. If 
k is even, then h~!(k) = k/2. 


Example 5.49. For the function f : X — Y in Figure the pre-image of 2 is 
f-*(2) = {a,c}. 


g(n) = 


lhe mG) we 


Figure 5.4.2. Since a and c are the only elements in X that map to 2, we 
have f—!(2) = {a,c}. 


Solved Problem 5.50. Let f: X — Y be a function, and let y1 and y2 be distinct 
elements of Y. Prove that f~'(y1) A f~*(y2) = 9. 


Solution. Suppose for a contradiction that there exists x € f~!(y,) A f~+(ye). 
Then x € f~'(y) and x € f~'(y2). This implies that f(x) = y, and f(x) = yp. 
Therefore, y; = ye, which is a contradiction. 


Solved Problem 5.51 (Pre-images and the subset relation). Let f: X > Y be a 
function, and let B, C By CY. Prove that f~!(B,) Cc f~*(Be). 


Solution. Let x € f~'(B,). Then f(x) € B,. Since B, C Bo, this implies 
that f(a) € Bg. Therefore x € f~1(B2). 


Exercise 5.38. Define fy: R — R by f,(x) = v*® fork € Z*. Explain how the 
pre-image set f, '(y) (e.g., its|\cardinality| |f;,‘(y)|) depends on y € R and whether 


k is even or odd. 


Note that this exercise is more about understanding what the pre-image of a real 
number for a power function is in various cases rather than about giving rigorous 
proofs. 


Exercise 5.39. Let f : X > Y be a function. Suppose that B and C are subsets 
of Y such that BNC =9. Prove that 


(5.116) f BaF =o. 
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Exercise 5.40. Let f, be the n-th Fibonacci number. Define g: Z* — {—1,1} by 
(5.117) g(n) = (-1)**. 

Describe g~1(1) and g~\(-1). 

Exercise 5.41. Given m € Z, letr: Z— Rm, be the remainder function defined 
by (414). Describe the pre-image 

(5.118) r-*(a) 

foraeZ. 


Exercise 5.42. Let f: X ~ Y be a function. Now suppose that B and C are any 

subsets of Y. Can you find and prove an “identity” for f-'(B)N f—!(C)? Namely, 

is there a different expression that this is equal to? 

Exercise 5.43. Let f: X > Y be a function. Suppose that B is a subset of Y. 
(1) Show that 


(5.119) f(f-\(B)) CB. 
(2) Describe under what circumstance 
(5.120) f(f-(B)) = B. 
(3) Complementarily, when is f(f~'(B)) a proper subset of B? 
Exercise 5.44. Let f : X > Y be a function. Suppose that A is a subset of X. 
(1) Show that 
(5.121) Ae {> F(A), 
(2) Describe under what circumstance 
(5.122) A= f71(f(A)). 


5.4.3. Compositions of injections, surjections, bijections. Injections, surjec- 
tions, and bijections have the following nice properties regarding their compositions. 


Theorem 5.52. 
(1) The composition of two injections is an injection. 
(2) The composition of two surjections is a surjection. 


(3) The composition of two bijections is a bijection. 


Proof. Let f: X — Y and g: Y — Z be two functions. 


(1) Let f and g be injections. Suppose that x1, x22 € X are such that go f(a) = 
go f(a). This says that g(f(a1)) = g(f(x2)). Since g is an injection, this implies 
that f(vi1) = f(#2). Now, f being an injection implies that 7; = rg. We have 
proved that go f is an injection. 

(2) Let f and g be surjections. Let z € Z. Since g is a surjection, there exists 
y € Y such that g(y) = z. Now, since f is a surjection, there exists « € X such 
that f(x) = y. We conclude that go f(x) = g(f(x)) = g(y) = z. We have proved 
that go f is a surjection. 

Part (3) follows from parts (1) and (2). 
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Exercise 5.45. Let f : X — Y be an injection such that f(X) C B, where 
BcY. Letg: Y > Z be a function with the property that if y1,y2 € B satisfy 
g(y1) = gly2), then y1 = ya. Prove that go f : X > Z is an injection. (Again, 
drawing a picture may be helpful!) 


5.4.4. Bijections and inverses. We say that a function f : X — Y is in- 
vertible if there exists a function g : Y + X which “undoes what f does”; that 
is: 


For all « € X, g(f(x)) =a and for ally € Y, f(g(y)) =y. 
We call g the inverse of f. 


For example, let a € R — {0} and let b € R and define the function f: RR 
by f(x) = ax+b. Then we can find the inverse of f as follows. Write the equation 
y = ax +0; that is, y is the value of f at z. Then we solve for x: x = yo which is 
well-defined since x £ 0. That is, if an input x has value y under f, then x = yt 


So we define g: R > R by g(y) = yt It is easy to check that then g is the inverse 
of f. 


The following says that being a bijection and being invertible are equivalent. 


Theorem 5.53. Let f: X > Y. Then f is invertible if and only if it is a bijection. 
Furthermore, if it is invertible, then its inverse function is unique. 


Proof. Invertible implies bijection. Suppose that f : X — Y is invertible, so that 
it has an inverse g: Y > X. 

(1) f is injective. Suppose that x1,2%2 € X are such that f(x) = f(v2). Then 
x1 = 9(f(x1)) = o(f(x2)) = x2. This proves that f is injective. 

(2) f is surjective. Let ye Y. Let = g(y) € X. Then f(x) = f(g(y)) = y. 
This proves that f is surjective. 


Buyection implies invertible. Suppose that f : X — Y is bijective. 

Let y € Y. Since f is surjective, there exists « € X such that f(x) = y. Since 
f is injective, 2 is unique (exercise: understand why). We then define g(y) = z. 
By definition, 

«= gly) = 9(f(@)). 

This equation is true for all x €¢ X. (Subtle explanation for the last sentence: This 
is true for all y € Y, where x = g(y). So given x € X, we let y = f(x), so that 
x = g(y).) 

On the other hand, given x € X, let y = f(x). Since x is the unique element 
of X with f(x) = y, we have that g(y) = x. Thus 


y = f(x) = f(g(y)) 


for ally € Y. 


Example 5.54 (Inverse trigonometric functions). An important use of inverse 
functions is to define inverse trigonometric functions. Consider the sine function 
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xX 


Figure 5.4.3. This illustrates a one-to-one correspondence between two 3- 
element sets X and Y. Correspondingly, we have bijections (that are inverses 
of each other) f : X — Y and g: Y — X defined by f(a) = 2, f(b) = 3, 
f(c) =1, and g(1) = ¢, g(2) = a, g(3) = b. 


sin: R- R. This function is neither injective nor surjective. We can easily make 
it surjective by restricting the codomain to the image and defining 


sin: R > [—1, ]]. 


(Prove that this function is surjective by using the Intermediate Value Theorem.) 
To make the function injective, as in §5.3.5.2]we can restrict it to a suitable interval; 
namely, we consider the restriction 


sin : [—7/2, 7/2] > [-1, 1]. 
Indeed, since sin’(x) > 0 for all x € (—7/2,7/2), we have that sin is a strictly 
increasing function on [—7/2, 7/2] (the Mean Value Theorem gives a rigorous proof 
of this). For simplicity, we have used the same notation for the restricted functions 
as for the original function. So, sin : [—7/2,7/2] > [—1,1] is a bijective function. 
Thus, by Theorem [5.53] sine has an inverse: 


sin~* : [—1,1] 3 [—2/2, 7/2]. 


s f- (2) = sin“ (x) 


Figure 5.4.4. The graph of the inverse sine function. 
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Similarly, the restriction of cosine, 
cos : [0,7] > [-1, 1], 
is bijective. Thus we have the inverse function 
cos’ : [—1,1] > [0,7]. 
We leave it as an exercise to graph the inverse cosine function. 


Example 5.55 (The square root function). Consider the bijective function f : 
R= —+ R@ defined by f(x) = x?. This function is bijective and its inverse is the 
function g : R@ — R= defined by g(x) = Vz. 


Exercise 5.46. Let 


Tv T 
(5.123) StnZ={o+knikeZh. 


Explain why the tangent function tan : R— (z + nZ) — R, when restricted to the 
open interval (—5, 5), has an inverse function. 
What other open intervals yield similar inverse functions? 


How do the graphs of the various inverse tangent functions compare? 


5.4.5. Injections, surjections, and bijections between finite sets. In this sec- 
tion we consider functions between finite sets from the point of view of counting. 
We will take a less rigorous approach, usually omitting proofs, since the results are 
especially intuitive. 

5.4.5.1. The cardinality of a finite set. 


Definition 5.56. Given a set X, if there is a bijection from N,, = {1,2,3,...,n} 


to X, then we say that X has cardinality|n and we write |X| =n. If |X| =n for 
some n > 0, then we say that X is|finite! Otherwise, we say X is infinite} 


Simply put, the cardinality of a finite set is the number of (distinct) elements 
it has. In the definition, we have made this notion more formal by choosing the 
“model set” N,, of n elements. A set X is “equivalent” to this set from the viewpoint 
of cardinality if there exists a bijection between X and N,. 


Lemma 5.57. Suppose that Y is a finite set and B is a subset of Y. Then B is a 
finite set and |B| < |Y]. 


That is, the number of elements of a subset of a finite set is less than or equal 
to that finite set. For example, any subset of N,, has cardinality at most n. 

5.4.5.2. Injections between finite sets. The domain of an injection has no more 
elements than the codomain. 


Theorem 5.58. If f: X + Y is an injection between finite sets, then 
(5.124) |X| < |Y]. 

In particular, for any injection from N,,, the codomain has cardinality at least 
n. And for any injection to N,,, the domain has cardinality at most n. The basic 


reason the theorem above is true is that the image set f(X), which is a subset of 
the codomain Y, has the same cardinality as X. 
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By taking the contrapositive of the statement of this theorem, we obtain the 
following, which is referred to as the pigeonhole principle 


Corollary 5.59. Let X and Y be finite sets. If |X| > |Y| andf:X 3 Y isa 
function, then f is not an injection. 


Intuitively, if the domain has more elements than the codomain, then the do- 
main has more elements than the image, which means that there must exist two 
distinct elements which map to the same element. 

5.4.5.3. Surjections between finite sets. The codomain of a surjection has no 
more elements than the domain. 


Theorem 5.60. Let X and Y be finite sets. If there is a surjection f: X > Y, 
then |X| > |Y]. 


In particular, for any surjection from N,,, the codomain has cardinality at most 
n. And for any surjection to N,,, the domain has cardinality at least n. 
The contrapositive of this is: 


Corollary 5.61. Let X and Y be finite sets. If |X| <|Y| and f:X > Y, then f 
is not a surjection. 


The basic reason for the theorem above is: For any function between finite sets, 
the cardinality of the image is at most the cardinality of the domain, and in the 
case of a surjection, the image is equal to the codomain. 

5.4.5.4. Bijections between finite sets. Since bijections are one-to-one corre- 
spondences, we have the following. 


Theorem 5.62. Let X and Y be finite sets. If f : X + Y is a bijection, then 
|X| =|]. 


Proof. This is a direct consequence of Theorems [5.58] and [5.60] 


In particular, for any bijection from N,, to N,,, we must have that n =m. 
If the domain and codomain of a function have the same number of elements, 
then being injective is the same as being surjective. 


Theorem 5.63. Let X and Y be finite sets with |X| = |Y|. Then f: X > Y is 
an injection if and only if f : X + Y is a surjection. 


Intuitively, we can think of things as follows. Let f : X — Y and define 
f(X) ={f(a) : « € X} (another notation for f(X) is Im f), the image of f. Then 


(5.125) lf (X)| < |X], 
and since f(X) C Y we also have 
(5.126) If AOS IY. 


If f is an injection, then |f (X)| = |X], so that |X| < |Y| by (126). 
If f is a surjection, then f(X) = Y, so that |f (X)| =|Y]|. Hence |Y| < |X| by 


(5.125). 
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5.5. Arbitrary unions, intersections, and cartesian products 


Let X be a set (called the ambient set), and let J be a set (called the index set). 


A family of subsets of X indexed by I is a function 
(5.127) A: I> P(X). 


We consider an indexed family of subsets as a collection of sets and we usually 
denote it by 


(5.128) { As hier: 
The {union is defined by 
(5.129) |) Ai = {a € X : die I such that « € Aj}. 
wel 


For example, if X is any set, then 


LU {x} =x. 
cExX 
The lintersection| is defined by 
(5.130) (Avis {ee X: Viel, ce Aj}. 
i€l 
Example 5.64. Let the ambient set be R. Define A; := [—i, i], and define B; = 
[—1/i,1/i], for ie Z*. Then 
(5.131) LJAc=R, () Bi = {0}. 
ie ie] 


Exercise 5.47. Prove the facts in Example [5.64| 
Exercise 5.48. Give an example of a collection {A;}iez+ of non-empty subsets of 
R satisfying the following properties: 

(i) Ajai C A; for allie ZY. 

(ii) Miez+ Ac = 9. 


Example 5.65. Let f : X — Y be a function. Consider the family of subsets of 
X indexed by Y defined by 


(5.132) A:Y + P(X), 

where 

(5.133) A(y) = f~"(y)- 

Then 

(5.134) Ur @=x 
yeY 


is a disjoint union. 


5.5. Arbitrary unions, intersections, and cartesian products 245 


Let {X;}ier be an indexed family of sets. We define its cartesian product| 
(5.135) I[% 


ict 


to be the set of functions 


(5.136) f:I>aUx% 
tel 

satisfying 

(5.137) f(a) © Xi. 


In the case where the index set J is infinite, we will assume the: 


Axiom of Choice, Let {X;}:e7 be an indexed family of non-empty sets. Then 
there exists a 


(5.138) choice : I (J X; 

wel 
satisfying 
(5.139) choice(i) € X; for all i € J. 


With this assumption, we see that if all the sets X;, i € I, are non-empty, then 
their cartesian product is also non-empty. 
Let us see how this definition compares with the n-fold cartesian product def- 


inition in the case where J is a finite set. Without loss of generality, assume that 
I =N,, where n € Z*. Define the function 


(5.140) F: ][ Xi: X1 x X2x--+x Xn 
iENy, 

by 
(5.141) F(f) = (f(1), F(2),--.5 F(m)). 
We leave it as an exercise to show that the inverse of F is given by the function 
(5.142) G:X1x X.x---x Xn |] 

tENn 
defined by 
(5.143) G(@1,%2,.--,2n) = fx; 


where fx : Nn + Ujen,, Xi is given by 
(5.144) fx(i) = 43. 


Thus F' and G are bijective. For this reason we consider the two definitions of the 
cartesian product of n sets to be equivalent. 
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5.6*. Universal properties of surjections and injections 


In this section, we consider what may be considered as a more advanced topic, 
“universal properties”. This perspective is useful in abstract algebra and its related 
fields. 


We have the following elegant, but abstract, characterization of injective func- 
tions. 


Theorem 5.66. Let f: X — Y be a function. Then 


f is an injection <= there exists a function g: Y — X such that go f =TIx. 
The function g is called alleft inverse| of f. 


Proof. We assume that X 4 Q; otherwise it is tautological. 


(<) Suppose that there exists a function g : Y + X such that go f = Ix. Let 
21,22 € X be such that f (a1) = f (x2). Then g (f (@1)) = g (f (x2)), just because 
g is a function. Since go f = Ix, this says that x1 = x2. We have proved that f is 
an injection. 


(=) Suppose that f is an injection. We need to define the left inverse g. For 
this purpose, we choose an element rp € X. Let ye Y. 

Case 1. y € Imf. Since f is an injection, there exists a unique x € X such 
that f (a) = yl'| Define g(y) =a. 

Case 2. y ¢ Imf. Define g(y) = x. (It doesn’t matter how we define g in 
this case.) 


Now that we have defined g, we prove that go f = Ix. Let x € X. Then since 
f (x) € Im f, we have g(f (x)) =x by Case 1. This proves go f = Ix. 


Theorem |5.66]may be summarized by the following commutative diagram 


x 
(5.145) GS 
¥Yoeo x 


The arrow — denotes that f is injective. The arrow --» denotes that g is the 
function posited to exist. By the diagram being commutative, we mean that both 
ways of getting from X to X are equal; namely, go f = Ix. 


The surjection universal property is: 
Theorem 5.67. Let f : X — Y be a function. Then 
f is a surjection =< there exists a functionh:Y — X such that foh=TIy. 
The function h is called a right inverse of f. 


Exercise 5.49. Prove Theorem |5.67 


3Details: (i) Existence. Such an « exists by the definition of y € Im f. 
(ii) Uniqueness. Suppose x’ € X is also such that f(a’) = y. Then f (x’) = f (x). Since f is an 
injection, x’ = a. 
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Figure 5.6.1. Schematic picture of an injective function f: X > Y. 


Figure 5.6.2. Picturing a left inverse g: Y + X of f. On f(X), g undoes 
what f does. On Y — f(X), g can be defined arbitrarily; e.g., given wo € X, 
we can define g(z) = wo for all z € Y — f(X). 


Theorem [5.67)may be summarized by the following commutative diagram: 


_ 
(5.146) bg, as Ee 
x= yy 


The arrow —» denotes that f is surjective. 
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Hint for Exercise What are the four elements of S? Is QU one of these 
elements? 


Hint for Exercise What can you say about the implication 7 €0>a2¢€A 
for every object x? 
Hint for Exercise Let A be a subset of 0. Let x € A. Use that A C @ and 
continue to get a contradiction. 
Hint for Exercise 

(i) We actually have equality. 
(ii) {0} € {0}. What is the sole element of {0}? Is {0} equal to this element? 
(iii) What is the sole element of { {0} }? 
(iv) { {0} } has exactly two subsets. Is {@} one of them? 
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(v) What property does the empty set have regarding the subset relation? 
(vi) Again, the set on the right-hand side has exactly one element. What is it? 


Hint for Exercise We prove both statements by a string of biconditionals. 
Start with 


x€As not( not (a € A)). 


What is the definition of x € A°? 


Hint for Exercise [5.61 The statement AC B means that x € A> a € B. What 
is the contrapositive of this implication? 


Hint for Exercise [5.7 (1) (<) Suppose that A = U. Let x € A®. Then « ¢ A. 
Use that A = U and that U is the universal set to get a contradiction. 


(=) Suppose that A° = 0. Let x € U. Clearly x ¢ 0. Since A° = 9, this implies 
that « ¢ A°. Continue. 


Hint for Exercise (1) It suffices to prove that U C AU A®. Consider two 
cases: x € Aand a € A. 


Hint for Exercise [5.9] (1) U-B=UN B= B. 
(2) A-9 = ANOS =ANU=A. 


Hint for Exercise [5.10} (1) Suppose that A C B. Prove that AN BC A and 
ACANB. 


Hint for Exercise Prove (<=) and (=), where in the latter case you may 
wish to use Lemma 


Hint for Exercise Take the proof of (6.17) and interchange M and U and 
“and” and “or”. 


Hint for Exercise Use the distributivity property for the union and inter- 
section of sets (5.22). 


Hint for Exercise [5.14} (1) We start the calculation by 
XY = LY + Layo ++++ + LnYn- 


Continue. 


Hint for Exercise Let (x,y) € A= [-1r,r] x [-r,r]. Then —r <a <r and 
—r<y<r, so that |a| <r and |y| <r. Continue. 


Hint for Exercise Use that 
(5.147) Ix — yl? = xl? + ly? - 2x-y. 


Hint for Exercise [5.17] Show that 
(5.148) Ix+y|? < |x| +lyl?. 


Hint for Exercise Let x,y € ker(¢), and let a,b € R. Using that ¢ is a 
linear function, compute 


(5.149) é(ax + by). 
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Hint for Exercise Use that ¢(x) := x- v is a linear function. Also observe 


that 
n, _ = n. CV _ 
{xeER ixvad={xeR (x-&)-v= \ 


= {y+ Ser iy-voh. 


Iv? 


Hint for Exercise Since the universal quantifier “for all” is stronger than 
the existential quantifier “there exists”, we have the < direction of the biconditional 
being true. 


Proof of the => direction: By definition, we have that there exists b € A such 
that 
Ap = {x—b: xe A} 


is a linear subspace of R”. 
Now let a € A. Show that 


(5.150) A= Ae, 
Hint for Exercise [5.21] Let 
(5.151) A:=L+xo ={x+xo:x ED}, 


where L is a linear subspace of R” and xg € R”. 
Show that 


A_x, ={y—-xo:yE AS =HL. 
Hint for Exercise [5.22] (i) We start the calculation by 


x—(x-n)n a 


f(x) n= 


Continue, while using that |n| = 1. 


1-x-n 


(ii) One way to answer this part is for y € L to show that one can uniquely 
solve for s in the equation |x|? = 1, where x = (1 — s)n+ sy. This will derive the 
formula for the inverse function g of f. 

Another way to see that g is the inverse of f is to make two calculations. Firstly, 
letting 


_ 4y + (ly? -1)n 


5.152 — = : 
(5.152) x= gly) = 
show that 
ly? —5 8 2(lyl? — 1) 
~ fy? +3’ so that ee ea PP rae 2 
Using this, show that 
f(g(y)) =y.- 


This will show that g is a right inverse of f. 
Secondly, show that g is a left inverse of f. 
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Hint for Exercise The intersection of the line {tv : t € R} with the unit 
sphere $” is the set of all tv, where ¢ € R is such that |tv| = 1. Using that v is 
assumed to be non-zero, we see that |t| = 1/|v|. Continue. 


Hint for Exercise It is easy to show that the function exists since x € S” 
implies that it is distinct from the origin, and for every two distinct points there 
exists a unique line containing the two points. Continue the explanation. 


Hint for Exercise None of these define functions from X to Y, for different 
reasons. 


Hint for Exercise The calculation that 
f(f(x)) = f(x — 2(x-u)u) =x 


is rather straightforward. This function is a reflection about the hyperplane 


(5.153) H :={x € R":x-u=0}. 
Observe that if x € H, then 
(5.154) f(x) =x-—2(x-uju=x. 


So the function f fixes points on the hyperplane H. Now, let x € R”. Then the 
projection 7 of x onto H is given by 


(5.155) a(x) = x—(x-u)u. 


To show that the function f is a mirror image, we add to m(x) the vector 1(x) — x 
to obtain 


w(x) +(x) —-x=x-—2(x-u)u. 

Draw some pictures to explain what this shows about the function f. 
Hint for Exercise [5.27] We have 
(5.156) (9° fx)(n) = 9(fx(n)) = g(n*). 
Consider the cases where n is even and where n is odd. 
Hint for Exercise [5.28] (1) We compute that 

((hog) 0 f(a) =(hog)(F(a)) = A(G(F(a))). 
On the other hand, 

(ho (go f))(x) = h((go f)(x)) = h(g(f(2))). 
Therefore (ho g)o f =ho(gof). 


Hint for Exercise Since the domain of the natural logarithm function is 
the set of positive real numbers, the domain D of f is the set 


cos(x) 
sat) > 


Show that the domain D is the union of open squares (that is, the interior of squares 
not including their boundaries) of side lengths 7 centered at the points 


(5.158) (Q2m7,2nm7) and ((2m+1)7,(2n+1)7), where m,n € Z. 


(5.157) D= (ew) ER: 
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Hint for Exercise Since a £ 0, we have a > 0 ora < 0. We assume a > 0, 
and leave it to you to work out the similar case where a < 0. We have 


(5.159) lim f(x) = +00, 
whereas 
(5.160) Jim F(x) = —00. 


Let yo be any real number. By (6.159), there exists b € R such that f(b) > yo. 
On the other hand, by (6.160), there exists a € R such that f(a) < yo. Apply the 
Intermediate Value Theorem. 


Hint for Exercise Let y € Y. Since f is a surjection, there exists m € Z 
such that f(m) = y. 
Case 1: mis even. That is, m € E. So, in this case, 


g(m) = f(m) = y. 
Case 2: m is odd. Then m = 2¢+ 1 for some integer €. So 
g(20) = f(20) = f2l+1) = f(m) =y. 


We have proved that g is a surjection. 
Hint for Exercise Suppose that m,n € Z* satisfy g(m) = g(n). Then 
|g(m)| = |g(n)|. Continue to prove that g is injective. 

Suppose that g(n) = 1. Derive a contradiction. 


Hint for Exercise [5.33} This function is surjective, but it is not injective. 


Hint for Exercise Suppose that 21, z2 € XUY are such that h(z,) = h(z2). 


Case 1: z1 € X. Then h(z2) = h(z1) = f(z1) € Y. Since X and Y are disjoint, 
this implies that z2 € X (if z2 € Y, then h(z2) € X, which is a contradiction since 
X and Y are disjoint). Thus we have that 


f (22) = A(z2) = f(a). 
Since f is injective, this implies that z1 = 22. 


Case 2: z, € Y. Then h(z2) = h(z1) = g(z1) € X. Since X and Y are disjoint, 
this implies that z2 € Y. Thus we have that 


g(22) = h(z2) = g(z1). 
Since g is injective, this implies that z; = z2. 
We have proved that h is an injection. 


Remark. A variant on the proof above is to split the two cases instead as to 
whether h(z1) = h(z2) is (1) an element of X or (2) an element of Y. In the first 
case one shows that z1,z2 € Y and uses that g is an injection, and in the second 
case one shows that z 1, Z2 € X and uses that f is an injection. We leave it to the 
reader to fill in the details. 
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Hint for Exercise [5.35}) (1) m = 3. The functions f; and f are injections. 

(2) m = 4. The functions f; and fs are injections. The function f2 is not an 
injection. 

(3) No. 

(4) Yes. 
Hint for Exercise As we will prove in a later chapter, for functions be- 
tween finite sets of the same cardinality, they are injections if and only if they are 
surjections. So the answers for this exercise are the same as the answers for the 
previous exercise. 
Hint for Exercise |5.37 


(1) If k is an odd positive integer, then f, is a bijection (and hence an injection 
and a surjection). 


(2) If & is an even positive integer, then f; is neither an injection nor a surjec- 
tion, and hence not a bijection. 


Hint for Exercise [5.38} Firstly, if k is a positive odd integer, then f, : R > R is 
a bijection with inverse function given by 


(5.161) (fe) *(y) = 9". 
What does this say about f, '(y) for each y € R? 


Secondly, suppose that & is a positive even integer. Consider the three cases 
y <0, y=0, and y > 0. 


Hint for Exercise Suppose that x € f-!(B)N f-1(C). Show that f(x) € 
BOC. Derive a contradiction. 


Hint for Exercise [5.40| The Fibonacci sequence has the pattern “odd, odd, 
even”. 


Hint for Exercise For a € Z, the pre-image 
(5.162) r_'(a)={at+tqm: qe Z}. 


Hint for Exercise Based on our solution to the previous exercise, we 
propose the identity 


(5.163) f(B)n fC) =f 7 (Bc). 
Show that 

(5.164) (BN fC) c f-'(BNC) 
and 

(5.165) f"(Bne) cf (@)nf(C). 


Hint for Exercise [5.43] (1) Let y € f(f~'(B)). Show that y € B. 
(2) Suppose that f(f~1(B)) = B. Let y € B. Show that y € Im(f). 
Conversely, suppose that B Cc Im(f). Let y € B. Show that y € f(f~+(B)). 


(3) We have f(f~!(B)) a proper subset of B if and only if f(f~!(B)) # B, 
which by part (2) is equivalent to B ¢ Im(f). 
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Hint for Exercise (1) Let x € A. Then f(x) € f(A). Thus z € f~1(f(A)). 
(2) We have x € f~1(f(A)) if and only if f(x) € f(A). So A = f-1(f(A)) 
means the same thing as 
x € Aif and only if f(x) € f(A). 
In particular, if « ¢ A, then f(x) € f(A). 
Hint for Exercise [5.45) Let 71,22 € X be such that (go f)(r1) = (go f)(x2). 
Then g(f(71)) = 9(f(%2)). Since f(X) C B, we have 


f (x1), f(2) € B. 


By our hypothesis on g, this implies that f(%1) = f(a2). Now, since f is an 

injection, we have that x; = xr. We conclude that go f is an injection. 

Hint for Exercise The tangent function has the property that it is strictly 
TT 


increasing on the open interval (—5,4) (open means that the interval does not 
include its endpoints). In fact, 


(5.166) lim tan(#) = +00, lim tan(a%) = —oo. 
xn /2 a>—n/2 

Apply the Intermediate Value Theorem. 

We also have that the derivative 

d 

(5.167) tan(x) = sec?(x) > 0 
for x € (—5, 5). What can you conclude from this? 

The analogous facts hold on any open interval of the form 
(5.168) (a = > ka + =) . 
where k € Z. 


The (infinite number of) graphs of the various inverse tangent functions are 
stacked on top of each other, where any given inverse is a “distance” a above the 
inverse right below it (and so of course a “distance” 7 below the inverse right above 
it). 

Hint for Exercise [5.47 (1) Let x € R. Then there exists a positive integer i 
such that i > |a|. 

(2) Let x be a non-zero real number. Then there exists a positive integer 7 such 
that 1/2 < Ja]. 

Hint for Exercise There are many examples! 

For instance, we may define 
(5.169) Aj ={x@eEF: a> i}, 
where F is R, Q, or Z. 

Another example is 
(5.170) Aj ={m@eEF:0<2< 1/t}, 
where F is R or Q. 
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Hint for Exercise [5.49] (<) Let y € Y. By assumption, there exists a function 
h:Y +X such that foh=TIy. We have 


(5.171) y =Iy(y) = (fo h)y) = f(A). 
Thus, there exists « € X, namely x = h(y), such that f(x) = y. This proves that 
f is a surjection. 

(=) Now suppose that f is a surjection. Let y € Y. Since f is a surjection, 
the pre-image set f~'(y) is non-empty. Choose « € f~'(y) and define h(y) = a. 
This defines a function h: Y + X. We compute that for every y EY, 


(5.172) (foh)(y) = f(h(y)) = f(@), where x € f(y). 
Since x € f~1(y), f(x) =y. Therefore 
(5.173) (foh)(y) = f(x) =y=lyly). 


We have proved that foh = Iy;i.e., h is a right inverse of f. 


Chapter 6 


Modular Arithmetic 


Figure 6.0.1. A “modulo 7” number wheel. Can you explain what this wheel 
represents? (It’s not a dartboard. ©) 


Goals of this chapter: To understand a powerful concept in number theory: 
congruence, which is based on division. To solve linear congruence equations. To 
prove an important result in number theory: Fermat’s Little Theorem. To see its 
application to cryptography. 
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6.1. Multiples of 3 and 9 and the digits of a number in base 10 


Modular arithmetic is actually a topic whose basics you are familiar with. We start 
with a couple of division facts which you may have learned in secondary school. 


There is a nice way to see if a (positive) integer is|divisible by 3, For example, 
consider the integer 123456789. We may add up all the digits to get 


(6.1) 14+24+344454647+8+9=45. 
If the result (in this case the number 45) is divisible by 3, then the original number 
is divisible by 3. Since 45 is indeed divisible by 3, we conclude that 123456789 is 
divisible by 3. Note that we can even check that 45 is divisible by 3 by adding up 
its digits to get 9, which is divisible by 3. 

Moreover, if the result is divisible by 9, then the original number is divisible 


by 9. So, since 45 is actually divisible by 9, we obtain the stronger result that 
123456789 is divisible by 9. Indeed, we check that 


(6.2) 123456789 = 9- 13717421. 
The facts above about division by 3 and by 9 are some of the many consequences 


of the arithmetic of congruence. To give us a taste of this, we now give proofs of 
these facts. 


Observe that we can write a 2-digit number as 
(6.3) a1a9 = a110' + ao 10°, 
where 0 < ag, a, < 9 and a, > 0. For example, 45 = 4-10'+5-10°. In general, we 
can write an (n + 1)-digit number as 
(6.4) Gn@n—1°+ + 420149 = Gn10” + an_110"~1 +. --- + a210? + 110! + a9 10°, 
where 0 < ao, @1,42,.--,@n—1,4n <9 and a, > 0. 

Next, we observe that each of the numbers 
(6.5) 9, 99, 999, 9999, ... 
is divisible by 9 and in particular divisible by 3. In other words, 

10" — 1 is divisible by 9 

for every positive integer k. So, if we divide our integer in (6.4) by 9, we can obtain 
a remainder of 
(6.6) Qn +Gn—1 +--+ +aq+a1 + ao. 


Another way of saying this is that the integers a,d,,_\---a2a a9 and ay, + a,—) + 
-++-+a2+a,+a9 differ by a multiple of 9. For example, 345 = 3+4+5+3-99+4-9. 
Therefore, we have proved: 
Proposition 6.1. The (n + 1)-digit integer a,,a,,—|--+a2a1ao is divisible by 9 if 
and only if the integer a, +4,—, +:::+4d2+a, +49 formed by summing its digits 
is divisible by 9. 
Since 3 divides 9, we have that 10" — 1 is divisible by 3. So, as a bonus we get 


the additional fact that a,@,,_\---@2a dp is divisible by 3 if and only if the integer 
Gn + Oyn—-1 +++: + a2 +a, + ao is divisible by 3. 
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For example, 345 = 3+4+5+43-33-3+4-3-3. So 345 is divisible by 3 if and 
only if 3+4+5 = 12 is divisible by 3. Since 12 is divisible by 3, we conclude that 
345 is divisible by 3. Note that we may also conclude that 345 is not divisible by 9. 


In the arguments above, we actually used modular arithmetic, a.k.a. the arith- 


metic of congruence. After you feel that you have read enough of this chapter, we 
invite you to revisit the discussion above and prove the facts by modular arithmetic. 


6.2. Congruence modulo m 


Consider the days of a non-leap year, numbered from 1 to 365. If January 1 is a 
Tuesday, then we know that March 27 is a Wednesday. This is because March 27 is 
the 86th day of the year and 86 = 7-12+2, so March 27 is the same day of the week 
as the second day of the year, which is a Wednesday. Here, we doing arithmetic 
“modulo 7”. 

Recall the modulo 7 number wheel in Figure at the beginning of this 
chapter. If, for example, we look to the right of its center, then we see the numbers 


2,9, 16, 23, 30. 
Another way of saying this is that we see 
7k+2, where k=0,1,2,3,4. 
So each of these integers, after being divided by 7, has a remainder of 2. 


The integers in a wedge of a given color in Figure[6.0.1]have the same remainder 
modulo 7. When integers have the same remainder (modulo a fixed number such as 
7), we can think of the integers as being the same or equivalent from a perspective. 
In the next subsection we will make this precise. 


0 7 14 21 
<j ° 


2 o_o __e—__® 
123 4 5 6 9 12 15 18 


Figure 6.2.1. The first six positive multiples of 3 and their remainders after 
dividing by 7. The remainder of each multiple is congruent to the multiple, 
and hence they are colored the same. 


6.2.1. Definition of congruence modulo m. Now we dig into the mathematics 
of modular arithmetic. 


Let m be a positive integer. 


Definition 6.2. We say that integers a and b are congruent modulo m if and 


only if a — b is divisible by m; that is, a — b is a multiple of m. 


For example: 
1 
2 
3 
4 


The integers 55 and 55 itself are congruent modulo 3. 
The integers 277 and 2 are congruent modulo 5. 


(1) 
(2) 
(3) The integers —3, 17, 29, and 37 are all congruent to each other modulo 4. 
(4) 


The integers 2, 9, 16, 23, 30 are all congruent to each other modulo 7. 
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(5) In Figure[6.2.1] numbers of the same color are congruent to each other modulo 
7. E.g., 18 = 4 mod 7. 


Non-examples are: 
(1) The integers —2 and 2 are not congruent modulo 5. 
(2) No two of the integers 16, 25, 49, 122,253 are congruent modulo 5. 


(3) The integers 123456789 and 9876543210 are not congruent modulo m if m is 
even. 


(4) A fun fact is that if m is odd, then —a and a are not congruent modulo m if 
and only if a and 0 are not congruent modulo m. At some point soon during 
your reading of this chapter, you should be able to prove this. 


We often write the condition that a and b are congruent modulo m as 
(6.7) m|(a— Db) 


(i.e., m divides a — b) and we often use the fact that this means that there evists 
q € Z such that 


a—b=mgq. 
For example, 
(6.8) (—23) | 92, 34 | 204. 


One writes 
a=b modm 


for a and b being congruent modulo m. For example, 


277=2mod5 and —3=17 mod 4. 


Exercise 6.1. Show that two integers a and b are congruent modulo 2 if and only 
if either a and b are both even or a and b are both odd. 


Idea: The difference of two even integers is even. The difference of two odd 
integers is even. Etc. 


Exercise 6.2. (1) Prove by induction that for every non-negative integer k, 
(6.9) 10" = 1 mod 9. 

(2) Use part (1) to show that for every non-negative integer k, 
(6.10) 10" = 1 mod 3. 


6.2.2. Properties of congruence modulo m. The relation of congruence has 
the following general properties: 


(1) Any integer a is congruent to itself modulo m; that is, 
a=amodm. 


This is because a — a = 0 is divisible by m since 0 = m- 0. 
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Figure 6.2.2. Pairs of integers (a,b), where a = b mod 3. 


(2) If a is congruent to b modulo m, then b is congruent to a 


modulo ™; i.e., 
a=bmodm implies b=amodm. 
Indeed, m | (a — b) implies m| (b— a). 


(3) If a is congruent to b modulo m and if b is congruent to c 
modulo m, then a is congruent to c modulo m; i.e., 


a=bmodm and b=cmodm implies a=c modm. 
Indeed, m| (a — 6) and m| (6 — c) implies 
(6.11) m| (a—b)+(b—c));. Le, m|(@—<). 


Remark 6.3. A relation that satisfies the properties above of reflexivity, symmetry, 


and transitivity is called an equivalence relation| (see §8.4] below for a general 


discussion). 
If we think of being congruent to as being excellent to, the equivalence relation 
properties of congruence say the following: 
(1) (Self-love) An integer is always excellent to itself. 


(2) (Mutual love) If a is excellent to b, then b is excellent to a. 
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(3) (Transference of love) If a is excellent to b and if b is excellent to c, then a is 
excellent to c. 


So, if a isn’t excellent to b, then b isn’t excellent to a! 


Recall that the remainder function r : Z > Rj», is defined by as 
(6.12) r(a) =7, 
where 0 < r < m is the unique remainder given by the Division Theorem. Since 
a=maq(a)+r(a), we have that 
(6.13) a =r(a) mod m. 
That is, any integer is congruent to its remainder. 
The following says that any two distinct remainders are incongruent. 


Lemma 6.4. No two distinct elements of the set Rm = {0,1,2,...,m— 1} are 
congruent to each other modulo m. 


Proof. Let a,b € Rm. Then 0 < a,b < m. Suppose that a = bmodm. Then 
m|(b— a). That is, there exists an integer k such that 
(6.14) mk =b—a. 


It is easy to see that 0 < a,b < m implies that —m < b—a < mi (exercisel). 
Therefore 


(6.15) —m<mk<m, whichimplies -—1<k<1. 


Since k is an integer, we conclude that k = 0, which implies that a = b. 


Exercise 6.3. Let a,b,m be real numbers. Prove that if 0 < a,b < m, then 
—m<b-a<m. 


Hint: You may use the fact that ifx <y andz<w, thenz+z<ytuw. 


The following says that if two integers are congruent modulo an integer, then 
they are congruent modulo any positive divisor of that integer. 


Lemma 6.5. If x = y mod m and if b is a positive divisor of m, then x = y mod b. 
Exercise 6.4. Prove Lemma [6.5} 
Example 6.6. By Lemma[6.5} If « = y mod 140, then x = y mod 35. 


0 35 70 105 140 175 210 245 280 
<+ |e {+ ——+ 
146 286 


Figure 6.2.3. 286 = 146 mod 140 implies that 286 = 146 mod 35. Indeed, 
286 — 146=1-140=4.-35. 

Solved Problem 6.7. The prime factorization of 123456789 is 

(6.16) 123456789 = 32 - 3607 - 3803. 


Using this fact, find the smallest integer greater than 123460000 which is congruent 
to 1 modulo 3607. 
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Solution. Firstly, we observe that 123460000 is greater than 123456789 and 
that their difference 


(6.17) 123460000 — 123456789 = 3211 


is less than 3607. By (6.16), 123456789 = 3607q for some integer q (note that 
q = 37 - 3803, but we won’t use this). Thus, 


(6.18) 123456789 + 1 = 1 mod 3607; 


however, 123456789 + 1 < 123460000. Thus, the smallest integer greater than 
123460000 which is congruent to 1 modulo 3607 is 


(6.19) 123456789 + 3607 + 1 = 123460397. 
Exercise 6.5. Note that 7-11-13 = 1001. 


(1) List the smallest 5 integers greater than 1000 that are congruent to 1 modulo 
13. 


(2) List the largest 5 integers less than 1000 that are congruent to 12 modulo 7. 
Exercise 6.6. (1) Find the largest integer less than 456456 that is congruent to 
—1002 modulo 1001. 
(2) Find the largest multiple of 13 less than 567567. 


(3) Letr:Z— Rig be the remainder function. Find r(579583) by “calculating by 
hand”. 


6.2.3. Congruence properties of addition and multiplication. Congruence is 
compatible} with addition, subtraction, and multiplication in the following sense. 


Example 6.8 (Congruence and addition and multiplication). Since 46 = 3 mod 43 
and 51 = 8 mod 43, we have 


46+ 51=3+8=11 mod 43 
and 
46-51 =3-8 = 24 mod 43. 


The congruences in the example above are special cases of the following result. 
This result says that the sums, differences, and products of congruent integers are 
congruent. 


Theorem 6.9 (Congruence and addition, subtraction, and multiplication). Ifa, = 
a2 mod m and 6; = bg mod m, then 
(1) ay+ by =aqt+ bg mod Mm, 
(2) ay — by = ag — bg mod m, 
(3) a1b, = agbz mod m. 
Proof. (1) By hypothesis, there exist integers k and @ such that a; — ag = mk and 
by — bg = mk. Hence 
(ay + bi) = (a2 + be) = (ay = a2) + (by — bz) 
=mk+me 
=m(k+ 2). 
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Since k+£ is an integer, we conclude from the definition of congruence that a,+b; = 
a2 + bg mod m. 


(2) We leave this as an exercise. 


(3) By hypothesis, there exist integers k and @ such that a; — ag = mk and 
b, — bg = mé. Hence 
(a1b1) — (agb2) = ab) + (—agb1 + agb1) — agb2 
= (a,b, — a2b,) + (agby — agb2) 
= (a1 — az) by + (b1 — be) a2 
= mkb; + mlag 
m (kb, + az). 
Since kb; + ¢az is an integer, we conclude from the definition of congruence that 
a,b, = agb2 mod m. 


Exercise 6.7. Prove part (2) of Theorem [6.9] 


I 


By using induction, we can extend the sum property of congruence. 


Exercise 6.8 (Extending Theorem[6.9[1)). Let m and k be positive integers. Prove 
that if we have integers a; and b;, where 1 <i<k, satisfying 


(6.20) a;j=b;modm forl<i<k, 
then their sums satisfy 
(6.21) a, +agt-+:-+ay =), +b. +--+ +b, mod m. 


Revisiting the subject of multiples of 3 and 9 discussed at the beginning of this 
chapter, we have: 


Exercise 6.9. Prove that 

(6.22) AnGn—1°** G2Q1A9 = An +Gy_1 +-++ + ag +44 + ap mod 9. 

Explain why in the congruence above, one can replace “mod 9” by “mod 3”. 
Hint: 


(6.23) AnOn—1°** 030201 = S- azl0*—". 
k=1 


Exercise 6.10. Consider an integer a,d,,—\: +: a2a,a9, where the a;’s are the digits 
of this (n + 1)-digit integer. Suppose that 


(6.24) Oy HOger fee? en + ay ag = brpbp_1 8 b2b1bo mod 9, 


where the b,;’s are the digits of the (k + 1)-digit integer bybyp-1---b2bibo. Explain 
why 


(6.25) An Gy—1°+* 20,d9 = by + be_-1 +++: +b2 +b, + bo mod 9. 
Finally, at the risk of being too pedantic (if you liked the proofs above, no need 


to read further!), here are slightly different proofs of parts (1) and (3) of Theorem 
{6.9} By hypothesis, there exist k,@€ Z such that 


a, =a, +km, by = bo + bm. 
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Then mod m, 


a, +b; = (a2 + km) + (b2 + Lm) 
Hagthbo+(k+l)ym 
= a2 7 ba, 
and also mod m, 
ab; = (a2 + km) (bo + lm) 


agb + (age + kb + kém) m 


= Agb. 


6.2.4. Congruence multiplication tables. 


Question: Why did the mathematician do multiplication problems on 
the floor? 
Answer: Because they were taught not to use tables. 


We can define multiplication on the set of remainders R,, = {0,1,2,...,m—1} 
as follows. (The set Ry, is also called the {least residue system modulo m|) Define 
(6.26) Xm: Rm xX Rm > Rm 
by 
(6.27) @ Xm b:= Xm(a,b) := r(ab), 


where r: Z — R,, is the remainder function. 


Example 6.10. Modulo 5, we have 2 x5 3 = 1, and the multiplication table for 
(Rs, X5) is given by 


x5/O0 1 2 3 4 
0/0 00 0 0 
1/0 12 3 4 
2|}0 241 3 
3/0 3 1 4 2 
4;0 43 2 1 


Exercise 6.11. Give the multiplication table for (R7, x7). 


Solved Problem 6.11. Find all 1 <a < 6 for which the function fa : Re — Re 
defined by (5.112) is a bijection. 


Solution. Recall that f, : Re + Re defined by 
a(x) = r(aa), 


where r: Z > Rg is the remainder function. Using this, we express the values of 
the functions fa, 1 < a < 6, in the table in Figure |6.2.4) From this table we see 
that the functions f; and fs are both bijections. On the other hand, we also see 
that each of the functions fo, f3, f4 is neither an injection nor a surjection. Observe 
that ged(1, 6) = gcd(5,6) = 1, whereas gcd(2, 6), gcd(3, 6), gcd(4, 6) > 1. 
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z | filx)| fol) | fa(x)| fa(z) | fo(z) 


Figure 6.2.4. The green shaded functions are bijections, while the red shaded 
functions are neither injections nor surjections. 


6.2.5. Congruence properties of perfect squares, products, and powers. Let 
us first consider perfect squares modulo 4. Let n be a perfect square. Then n = a?, 
where a is a non-negative integer. 


Case 1: a is even. Then there exists an integer k such that a = 2k. We 
calculate that 


(6.28) n—o = (2k)? =—4-k?. 


Hence n = 0 mod 4. 


Case 2: a is odd. Then there exists an integer / such that a = 2k+1. We 
calculate that 


(6.29) n=a7 = (2k+1)? =4-(k? +k) 41. 


Hence n = 1 mod 4. 


We conclude: 


Lemma 6.12. If n is a perfect square, then n = 0 or 1 mod 4. 


For example, we can immediately deduce that 10!°! +2 and 10!°! + 3 are not 
perfect squares since they are congruent to 2 and 3 mod 4, respectively. 


Now we make a few remarks about more general moduli m. Recall from (6.13) 
that a = r(a) mod m. By Theorem [6.9(3), we have 


(6.30) a” =r(a)? mod m. 


Since r(a) is contained in the remainder set R,,, for every integer a, we have that 
any perfect square is congruent modulo m to r? for some remainder r. 
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For example, taking m = 3, we see that modulo 3 we have 0? = 0, 17 = 1, and 
2? = 1. Hence: 


Lemma 6.13. For any integer a, a? = 0 or 1 mod 3. 


Exercise 6.12. Can an integer of the form 3k + 2, where k © Z, be a perfect 
square? 


Exercise 6.13. Let n be a perfect square. Prove the following: 


1) n=0, 1, or 4 mod 5. 


( 

(2) n= 0, 1, 3, or 4 mod 6. 
(3) n= 0, 1, 2, or 4 mod 7. 
(4) n=0, 1, or 4 mod 8. 


By using induction, we can extend the product property of congruence. 


Lemma 6.14 (Congruence and multiple multiplications!). Let k be a positive in- 
teger. If we have integers satisfying 


(6.31) a;j=b;modm forl<i<k, 
then their products satisfy 

(6.32) a1 42°++ Ap = bi b2+++bp mod m. 
In particular, if a= b mod m, then 

(6.33) a® = b* mod m. 
Proof. We prove this by induction. 


The base case (k = 1) is tautologically true. 
Now suppose that k is a positive integer with the property that for all sets of 


integers a,,...,@, and b;,...,b, satisfying a; = b; mod m for 1 <i< k, we have 
(6.34) €10°+: Ap = b1b2---by mod m. 
Now suppose that a1,...,@%41 and bi,...,b,41 are integers satisfying a; = b; mod 


m for 1 <i<k-+1. Then, by (6.34) and az41 = 6,41 mod m, we have 


(6.35) @102°** Opi = (41G2°++ GR) + Opa 
= (b, bo <8 br) . be41 
= by be aa byt mod m. 


By induction, the theorem has been proven! 


Exercise 6.14. Using Lemma [6.14 give a shorter proof that 10 = 1 mod 9 for 
every positive integer k. 


Exercise 6.15. Using Lemma [6.14] give another proof that the product of any 
finite number of odd integers is odd. 


Exercise 6.16. Prove that for every positive integer k, 1000000" = 1 mod 13. 
Hint: 1000000 = 10007. 
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6.3. Inverses, coprimeness, and congruence 


In the set of integers, only 1 and —1 have multiplicative inverses. On the other 
hand, in the set of rational numbers, any non-zero number has a multiplicative 
inverse. We now consider inverses modulo m. 


Definition 6.15. Let m be a positive integer. Let a be an integer. We say that 
an integer b is a (multiplicative) inverse] of a modulo m if 


(6.36) ab = 1 mod m. 

Theorem 6.16. Let m be a positive integer, and let a be an integer. Then: 
(6.37) a has an inverse modulom <= gcd(a,m) = 1. 

Proof. (<=) Suppose that a and m are coprime. By Theorem [4.20] there exist 
integers b and n such that 

(6.38) ab+mn=1. 


This implies that b is an inverse of a modulo m. 


(=) Suppose that an integer a has an inverse b modulo m. Then ab = 1 mod m. 
This implies that there exists an integer q such that 


(6.39) ab —1= qm. 
By Theorem [4.20] this implies that a and m are coprime. 


In particular, if m is a prime, then a has an inverse modulo m if and only if a 
is not a multiple of m. 


Can you find the inverses modulo 7 of each of the integers 1, 2,3,4,5,6 by just 
looking at the colorful multiplication table in Figure [8.1.4] in Chapetr [B? Check 
your answers by calculating remainders of products! 


Solved Problem 6.17. Find all pairs of inverses modulo 30; namely, find all 
(x,y) with « < y satisfying xy = 1 mod 30. 


Solution. By straightforward calculations, one sees that the pairs of all in- 
verses are 


(6.40) (1,1), (7,13), (11,11), (17, 23), (19,19), (29, 29). 
The integers between 1 and 29 without inverses are those not coprime with 30, 
which are 
(6.41) 2, 3, 4, 5, 6, 8, 9, 10, 12, 14, 15, 16, 18, 20, 21, 22, 24, 25, 26, 27, 28. 
Observe that for each integer a in this list, there exists an integer 0 < b < 30 
satisfying ab = 0 mod 30. 

Inverses, if they exist, are unique with respect to any modulus. 
Exercise 6.17. Suppose that a has an inverse b modulo m. Prove that any inverse 
of a is congruent to b modulo m. Hint: Use Theorem [4.23] 


Thus, if0<a<m and gcd(a,m) = 1, then a has a unique inverse b modulo 
m satisfyingO<b<m. 
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Solving linear congruence equations of a single variable is generally easy. For 
now, we just content ourselves with a few small modulus examples. 


Exercise 6.18. For integers 2 << a< m < 8 satisfying gcd(a,m) = 1, find the 
unique integer 0 <a <m satisfying 


(6.42) az = 1modm. 
Exercise 6.19. Find the smallest 5 positive integers x satisfying the equation 
(6.43) 2x = 1 mod 3. 


Do you see a pattern? 


In view of how we take negatives modulo m, we have: 


Lemma 6.18. Let m be a positive integer, and let0 <a,b<m. Ifab=1modm, 
then 


(6.44) (m — a)(m — b) = 1 mod m. 
Exercise 6.20. Prove Lemma |6.18 
As an application of Lemma [6.18] consider for example the inverse pairs in 


Solved Problem [6.17] Since (7,13) is an inverse pair, we can deduce that 
(30 — 7,30 — 13) is an inverse pair; that is, (17,23) is an inverse pair. 


6.4. Congruence and multiplicative cancellation 


In this section we discuss a way to simplify congruence equations. We suggest that 
the readers practice with lots of examples until they get the hang of it. 

Here is the general result about dividing congruence equations. In other words, 
this is the multiplicative cancellation property. This is a very useful property for 
simplifying congruence equations. 


Theorem 6.19 (Congruence and division). Let m € Z* anda € Z and let g = 
gcd (a,m). We have 


(6.45) aby =abgmodm © 6; = bg mod ¢ 
Proof. Suppose that ab; = abz mod m. This is equivalent to m dividing a (b; — ba). 


By Theorem [4.32] this in turn is equivalent to % dividing 6; — 62, which finally is 
equivalent to b; = b2 mod 


As special cases of the multiplicative cancellation property, we have the follow- 
ing two corollaries. They represent the extreme cases for gcd (a, m). 


Corollary 6.20 (Congruence and division by a divisor of the modulus). Suppose 
that a is an integer which divides m (i.e., m/a is an integer). Then 


aby =abp mod m © by =b2 mod m/a. 


Proof. This is the theorem in the special case where a divides m, which implies 
that ged (a,m) = a. 
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Corollary 6.21 (Congruence and coprime division). Suppose that a is an integer 
such that a and m are coprime (i.e., gcd(a,m) = 1). Then 


ab} =abo2 mod m & 6b; =be mod m. 


Proof. This is the theorem in the special case where gcd (a,m) = 1. 


Example 6.22. We have 
(1) By Corollary {6.20} 
6b; = 6b. mod 18 S = 6b, =be mod 3. 
(2) By Corollary [6.27] 
35d; = 35b2 mod 18 = 6; =be mod 18. 


(3) By Theorem 


156; = 15b2 mod 18 & 6; =be mod 6. 
Observe that these results follow from: 


(1) 6b = 0 mod 18 if and only if b=0 mod 3. That is, 6 = 18k for some ¢ € Z 
if and only if b = 3¢ for some k € Z. 


(2) 35b = 0 mod 18 if and only if b= 0 mod 18. That is, 35b = 18k for some 
k € Zif and only if b = 18¢ for some ¢ € Z. 


(3) 15) = 0 mod 18 if and only if b = 0 mod 6. That is, 15b = 18k for some 
k € Zif and only if b = 62 for some ¢ € Z. 


Exercise 6.21. For each of the following congruences, write an equivalent congru- 
ence of the form b; = bz mod m: 


(2) 9b; = 9b2 mod 21. 
(3) 28b, = 28b2 mod 45. 
We take this opportunity to observe the following. 
Lemma 6.23. Let m be a positive integer. Ifa = bmodm, then 


(6.46) gcd(a,m) = ged(b, m). 


Proof. Suppose that a = b mod m. Then there exists an integer q such that 
(6.47) a=mqtb. 
By Theorem [4.16] we conclude that gcd(a,m) = ged(b, m). 


Exercise 6.22. Let p be a prime, and let a,b € Z. Prove that ab = 0 mod p if and 
only if a = 0modp orb=O0modp. Hint: Rephrase the conditions in terms of 
division, and use properties of primes and division. 
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6.5*. Fun congruence facts 


6.5.1. Squares of primes. Squares of primes have the following interesting 
congruence property. 


Theorem 6.24. If p > 3 is a prime, then p? = 1 mod 24. 


Proof. Since p is odd, both p—1 and p+ 1 are even. Hence, by Exercise 
4 divides one of them (while the other of course is divisible by 2). Thus 8 divides 
(p — 1)p(p +1) = p(p? — 1). Since we also have that 3 divides (p — 1)p(p+ 1) and 
since 3 and 8 are coprime, we have that 24 divides p(p? — 1). On the other hand, 
since p > 3 is a prime, we have that 24 and p are coprime. Therefore 24 divides 
pe —1. 


Of course, the conclusion of the theorem is not true if p = 2 or p= 3. 


Example 6.25 (Squares of primes p < 20). The squares of the primes between 5 
and 19, inclusive, are 25 = 2441, 49 = 24-241, 121 = 24-541, 169 = 24-741, 
289 = 24-1241, 361 = 24-15 +1. 


Example 6.26. Consider primes that are congruent to 3 modulo 4. The list of the 
first 12 primes pj, po,...,pi2 satisfying p; = 3 mod 4 is: 


(6.48) 3, 7, 11, 19, 23, 31, 48, 47, 59, 67, 71, 79, 83. 
Observe that for each 1 < k < 7, the prime factorization of 2p; --- pz +1 generates 
a prime q which is congruent to 3 modulo 4 but is different than all of p1,..., Dx. 


For example, when k = 5, the new such prime is 271. See Table|6.5.1] This example 
illustrates the key idea of the proof of Theorem [6.27] below. 


Table 6.5.1. The new primes congruent to 3 modulo 4 are colored in red. 
Observe that regarding the prime factorization of 2p; pop3pap5 + 1, although 
5 = 1 mod 4 and 149 = 1 mod 4, we have 271 = 3 mod 4. 


k Dixess9Dk 2p1+--pr+1) prime factorization 
1 3 7 ¢ 

2 3,7 43 43 

3 3,7, 11 463 463 

4 3,7, 11,19 8779 8779 

5 3, 7, 11,19, 23 201895 5-149 - 271 

6 | 3,7,11,19, 23,31 6258715 5 - 1251743 

7 | 3,7, 11,19, 23, 31,43 | 269124703 6151 - 43753 


6.5.2. Infinitude of primes congruent to 3 mod 4. A result stronger than The- 
orem [4.12] on the infinitude of primes is the following. 


Theorem 6.27 (Bountiful primes congruent to 3 mod 4). There are an infinite 
number of primes p satisfying p = 3 mod 4. 


Proof. Suppose that there are only a finite number of primes p satisfying p = 
3 mod 4. Let 


(6.49) {pP1,P2,--+,Dk} 
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be the complete list of such primes. Note that since p; = 3 mod 4, we have that p; 
is odd for each 1 <i< k. Define 


(6.50) q := 2pipo-+:p,t+1. 
Since p,p2---pyz is odd, we have that 2p, p2---p, = 2 mod 4, and hence 
(6.51) qg=3 mod 4. 


Of course, the integer q is odd. 

Note that for each 1 < i < k, p; divides 2p;p2--- pz, and hence p; does not 
divide q. 

Since q > p; for all 1 <i<k, we have that q is not a prime. Let 
(6.52) q= G48 +g" 
be the prime factorization of g. Since q is odd, so is each qj, 1 <j < s. Thus, for 
each j, qj is either congruent to 1 or 3 modulo 4. 

Since each p; does not divide q and since the p; are all the primes congruent 
to 3 mod 4, we must have that each q; is congruent to 1 modulo 4. However, this 
then implies that g is congruent to 1 modulo 4, which is a contradiction to (6.51). 

Since the sole assumption we made to obtain the contradiction is that there 


are only a finite number of primes p satisfying p = 3 mod 4, we conclude that there 
must exist an infinite number of primes p satisfying p = 3 mod 4. 


The theorem begs the question: Are there an infinite number of primes p 
satisfying p = 1 mod 4? With the help of Fermat’s Little Theorem [6.63] below, we 
can answer this in the affirmative! See Exercise [6.38] below. 


6.6. Solving linear congruence equations 


Having discussed the basic properties of congruence, we may now solve linear con- 
gruence equations. 


6.6.1. How to solve linear congruences via examples. A linear congruence 
equation of a single variable is an equation of the form 
(6.53) ax = bmodm. 
Here, we wish to find all integers x which satisfy this linear congruence equation. 


Remark 6.28. If a = 1 (or more generally if a = 1 mod m), then the answer is 
easy: The solutions are the integers satisfying x = b mod m, which are the integers 


(6.54) b,b+m,b—m,b+ 2m, b— 2m,b+ 3m,b—3m,.... 
Example 6.29. Consider the linear congruence equation 
6x2 = 15 mod 21. 


By inspection, we see that x = 6 is a solution, because 6-6 = 36 is congruent 
to 15 modulo 21. But what are the other solutions? We may then observe that 
x =6+7 = 13 is also a solution. This is because then 62 = 6(6+ 7) = 36+ 42 and 
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as we just saw, 36 is congruent to 15 modulo 21, whereas 42 is a multiple of 21. We 
may guess that the general solution to our example linear congruence equation is 


c=6+7k, kEZ. 


This is indeed the case. We now discuss how to solve this and other congruence 
equations. 


To be more concrete, we approach solving linear congruence equations by work- 
ing some examples. Recall from Theorem|6.19]the following: Let m € Zt anda é Z 
and let g = gcd(a,m). Then 


(6.55) abj =aby mod m © 0b, =by mod —~. 
g 


Example 6.30. We use this result to see how we can simplify linear congruence 
equations. Consider 


4a¢ = 12 mod 14. 
By (6.55), we have that 


4x =12 mod 14 sS 47%=4-3 mod 14 8S «=3 mod 7 
14 


gcd(4, 14) 
congruence equation if and only if s =3+7k, where k € Z. 


since gcd(4,14) = 2 and = 7. That is, x is a solution to the linear 


Recall from Corollary[6.20} Suppose a divides m (so that ged (a,m) = a). Then 
aby =abop mod m S&S bh = be mod il 
a 


Example 6.31. Let us look again at the first example of this section: 


6z2=15 mod 21 Ss 3-2%=3-5 mod 21 @ 2x=5 mod 7 


since 2 = 7. At this point, we still haven’t derived the solution to this linear 


congruence equation. We’ll do that later. 


Recall from Corollary Suppose that a and m are coprime; i.e., gcd (a, m) = 
1. Then 


aby =abp mod m & 6, =be mod m. 
Example 6.32. Consider the linear congruence equation 
22=12 mod 7 & 22=2-6 mod 7 S&S 2«£=6 mod 7 


since gcd(2,7) = 1. This solves the linear congruence equation; i.e., the solutions 
are the integers of the form « = 6+ 7k, k € Z. 


The next result helps us when we are stuck in trying to solve a linear congruence 
equation. 


Lemma 6.33. ax = b mod m is equivalent to ax = b+cm mod m for every integer 
C. 
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Example 6.34. 
2x=5 mod 7 & 2x¢%=12 mod 7 


since 12 = 5+ 7. The motivation for considering the equivalence above is that 5 is 
not divisible by 2, but the integer 12 congruent to it (mod 7) is. 


Can you solve 2% = 5 mod 7 by looking at the colorful multiplication table in 
Figure [8.1.4] in Chapter [8? Which row or column of the multiplication table did 
you look at? 


Putting things together, we solve the first example of this subsection. 
Example 6.35. Solve 
6a = 15 mod 21. 
All of the following steps we have actually carried out above! 
Step 1. This linear congruence equation is equivalent to 
2x = 5 mod 7. 


Step 2. The equation 27 = 5 mod 7 is equivalent to 
2x = 12 mod 7. 


Step 3. The equation 2x7 = 12 mod 7 is equivalent to 
xz =6 mod 7. 


The key is that the coefficient in front of x is 1 so we can just read off the answer 
as: The general solution to the linear congruence equation is x = 6+ 7k, where 
k € Z. This proves the claimed solution to Example [6.29] 


We can also choose to describe the solutions using the “original” modulus 21 
instead of the “new” modulus 7. We see that x is a solution to the linear congruence 
equation if and only if 

x = 6, 18, or 20 mod 21. 


That is, we add 7’s to 6 until we get back to an integer congruent to 6 modulo 21. 
Observe that if we add 7 to 20, then we get 27, which is congruent to 6 modulo 21. 


Observe also that the number of solutions modulo 21 to the linear congruence 
equation is equal to 3, which is the gcd of 6 and 15 coming from 62 = 15 mod 21. 
We shall see later that this true in general. 


Exercise 6.23. For each of a = 1,2,3,4,5,6, find the unique 0 <a <7 satisfying 
(6.56) 5x2 = amod 7. 

6.6.2. How to solve a general linear congruence. Let a,b € Z be non-zero 
integers. Consider the general linear congruence equation for x € Z: 
(6.57) ax =b mod m. 


Remark 6.36. Equation (6.57) is indeed the general form of a linear congruence 
equation. Consider the apparently more general linear congruence equation 
(6.58) a,x + a9 =b mod m. 


All we have to do is move the ag to the other side to get an equation of the form 


(6.57)! 
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Lemma 6.37 (Criterion for non-existence). There does not exist a solution x € Z 
of the linear congruence equation ax = b mod m if and only if g = gcd (a,m) does 
not divide b. 
Equivalently: 
Lemma 6.38 (Criterion for existence). There exists a solution x € Z of the linear 
congruence equation ax = b mod m if and only if g = gcd(a,m) divides b. 
Proof. We have the following equivalences of statements: 
There exists x € Z, ax = bmod m 
<> there exists « € Z such that there exists y € Z, av — b= —ym 
= there exists z,y € Z, ax + ym = b. 


The last equation is a linear Diophantine equation and we have already proved that 
it has a solution if and only if g = gcd(a,m) divides b. 


Example 6.39 (Non-existence). The linear congruence equation 
21xz = 5 mod 33 

does not have a solution since gcd(21, 33) = 3 does not divide 5. 

Example 6.40 (Existence). The linear congruence equation 
21x = 6 mod 33 

has a solution since gcd(21, 33) = 3 divides 6. 

A particular solution is « = 5 since 
21-5=105 = 33-3+6. 


(This particular solution was found “by inspection”. A more systematic way would 
be to solve the linear Diophantine equation 212 +33y = 6 as we have done in earlier 
chapters.) 


Since 21-5 = 6 mod 33, z is a solution to 

21x = 6 mod 33 

21 (a — 5) = 0 mod 33 
33 divides 21 (a — 5) 
11 divides 7 (x — 5) 

11 divides x —5, 


tty 


where the last equivalence is true since gcd(7, 11) = 1. Soa complete set of solutions 
x modulo 33 is = 5,16,27. Again, we see that the number of solutions modulo 
the “original modulus” 33 is equal to 3, which is the gcd of 21 and 33. 


Exercise 6.24. List all the integers 0 < b < 15 such that the linear congruence 
equation 


(6.59) 6a = b mod 15 
has solution(s). 


For each such integer b, find all integer solutions 0 <a < 15. 
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By arguing more generally along the lines of Example we obtain: 


Theorem 6.41 (General solution from a particular solution). Let m= = where 
g = gcd(a,m). If xq is a solution to : 

(6.60) ax = bmod™m, 

then a complete set of solutions x modulo m is given by 

(6.61) Xo, 2o +M, Lo +2mM,...,%09 +(g —1)m. 

In particular, there are g solutions modulo m. 

Example 6.42. Consider the linear congruence equation 

(6.62) 6x = 3 mod 15. 

By inspection we see that zo = 3 is a solution. We have that gcd(6, 15) = 3. Thus 


m = 2 =5. So, by Theorem [6-41] a complete set of solutions 2 modulo m is given 
by 


(6.63) 3, 8, 13. 


Observe that this result agrees with Corollary [6.20] telling us that the congruence 
equation (6.62) is equivalent to 


(6.64) 2x =1 mod 5. 


We have the following important consequence of Lemma [6.38] 


Corollary 6.43. Let p be a prime. There exists a solution x € Z to the linear 
congruence equation 


(6.65) ax =1modp 
if and only if a is not a multiple of p. 


Proof. The (<) direction is easy to see by considering its contrapositive. 


The (=) direction follows from Lemma and the fact that if a is not a 
multiple of p, then gcd(a, p) = 1 (see Proposition [2.17). 


Again, let p be a prime. Suppose that a is not a multiple of p. Observe that if 
integers x and y satisfy (exercise!) 


(6.66) ax =1 mod p, ay = 1 mod p, 
then 
(6.67) x = y mod p. 


So the solution x in Corollary [6.43] is unique. A solution x is called an inverse of 
a modulo p. 

An interesting question is: When is an integer its own inverse modulo a prime 
p? 
Lemma 6.44. Let p be a prime. An integer a is its own inverse modulo p if and 
only if a= 1 mod p ora =—1 mod p. 
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Proof. Suppose that a € Z satisfies 


(6.68) a? =1mod p. 
Then 
(6.69) (a—1)\(a+1) =a? —1=0modp. 


So, by Exercise [6.22] we have that a — 1 = 0 mod p or a+ 1 = 0 mod p. 


Example 6.45 (Pairs of inverses modulo a prime). (1) For p = 7, modulo 7 the 
pairs of inverses (x,y) with ry = 1 and x < y are 


(6.70) (1,1), (2,4), (8,5), (6,6). 
(2) For p= 11, modulo 11 the pairs of inverses are 
(6.71) (1,1), (2,6), (8,4), (5,9), (7,8), (10, 10). 
Solved Problem 6.46. Find all pairs of inverses (x,y) modulo the prime 23 with 


zy =1 mod 23 and z < y. 


Solution. By inspection, where we provide the relevant calculations below, we 
see that these pairs are 


(1,1), (2,12), (3,8), (4,6), (5,14), (7,10), 
(9,18), (11,21), (13,16), (15,20), (17,19), (22, 22). 


For the pairs of inverses in the first row, the calculations we made to obtain these 
answers are 


2-12=24=23+1, 
3:8=24= 2341, 
4-6=24=23+1, 
5-14=70=3-234+1, 
7-10 =70 =3-234+1, 


and for the pairs in the second row, we applied Lemma [6.18] to the first row with 
the calculations 


23 — 1 = 22, 
23-2=21, 28-12=11, 
23-3=20, 23-8=15, 
23-4=19, 23-6=17, 
93—h=18, 23-14=9, 
23—-7=16, 23—-10=13. 


Exercise 6.25. Similarly to Example [6.45) write out the pairs of inverses (x,y) 
with cy =1mod p and « < y, for p= 13 and for p = 17. 
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Non-Example 6.47. If m is not a prime, then we have more possibilities for 
integers to be their own inverse modulo m. 


(1) Let m= 8. Then 1, 3, 5, and 7 are each their own inverse. 
(2) Letm=15. Then 1, 4, 11, and 14 are each their own inverse. 
(3) Let m = 20. Then 1, 9, 11, and 19 are each their own inverse. 


We end this subsection with the following generalization of Corollary [6.43] 


Theorem 6.48. Let m be a positive integer. There exists a solution « € Z to the 
linear congruence equation 


(6.72) ax =1modm 
if and only if a is coprime to m. 
Exercise 6.26. Prove Theorem [6.48] 
Example 6.49. For m = 15 the remainders that are coprime to 15 are 
1,2,4,7,8,11, 13,14. 
The set of all pairs of inverses (x,y) with zy = 1 mod 15 and a < y are 
(6.73) (1,1), (2,8), (4,4), (7,13), (11,11), (14, 14). 
Exercise 6.27. For each 1 <a< 17 find 1 <6 < 17 such that ab = 1 mod 18 if 
such an integer b exists. If you like, you may write your answer in a format similar 
to Example |6.49 
6.6.3. Solving congruences revisited and examples. Consider 
ax = bmodm 


and let g = gcd(a,m). If g does not divide b, then there are no solutions. So we 
may assume that g divides b. We then get that our linear congruence equation is 
equivalent to 


b 
oe = — mod il 
g g g 
where gcd @ m) = 1 as we know from a previous result. 


We will take this as saying that we only need to consider 
ax = bmodm, 
where gcd(a,m) = 1, which we now assume. 
6.6.3.1. Solving linear congruences by solving linear Diophantine equations. One 
way of proceeding from here is to solve the linear Diophantine equation 
az+my = 1, 
which we can by reversing the Euclidean algorithm since gcd(a,m) = 1. We then 
let « = b% and y = by. Then 
ax + my = b. 
Then z is a solution to 
ax = bmodm. 


We then add to x the multiples of 7 from 1 to g—1 to get all solutions modulo m. 
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Example 6.50. 


6x2 = 18 mod 14. 
We have that g = gcd(6, 14) = 2. So our equation is equivalent to 
3x = 9 mod 7, 


where we observe that gcd(3,7) = 1. By reversing the Euclidean algorithm| we 


obtain that 

3-(-2)+7-1=1. 
(In the notation above, Z = —2 and ¥ = 1. Thus x = 2- (—2) = —4 is a solution 
to our linear congruence equation. We then add to this solution + = 7 to get the 
only other solution modulo 14, which is —4+ 7 = 3. That is, our two solutions 
modulo 14 are 


a=-4 and x«=3. 


Another way we could have done this is simply to apply Theorem [6.19] to get 
that our linear congruence equation is equivalent to 
x=3mod7 


since gcd(6, 14) = 2. Thus, modulo 14, the solutions are x = 3 and x = 10 (note 
that 10 = —4 mod 14). 


6.6.3.2. Solving linear congruences by adding a good multiple of m. We recall 
the observation that the congruence equation az = b mod m is equivalent to there 
existing y such that ax — b = my; that is, ax = b+ my. In fact, for every y € Z, we 
have 


(6.74) ax=bmodm & ax=b+mymodm. 
So if we add the right multiple of m to 6, the result will be divisible by a. 


Example 6.51 (62 = 15 mod 21 revisited). Of course 15 is not divisible by 6. So 
we keep adding 21 to it until it is: 


15 +21 = 36, 


which is divisible by 6 (we were lucky on the first try). Thus our linear congruence 
equation is equivalent to 


6a = 36 mod 21, 
which on the other hand is equivalent to 
x =6 mod 7 


since gcd(6, 21) = 3. Thus we obtained the answer rather easily. 
Solved Problem 6.52. Solve the linear congruence equation 
(6.75) 522 = 24 mod 18. 


Solution. Equation (6.75) is equivalent to 
132 = 6 mod 9 
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since gcd(4, 18) = 2 and 18/2 = 9. The quickest way to solve this equation is to 
add multiples of 9 to 6 until we get a multiple of 13. In this vein we consider the 
equivalent equation 

13x =64+8-9= 78 mod 9. 
Since 78 = 13-6 and gcd(13, 9) = 1, we see that this equation is in turn equivalent 
to 

x =6 mod 9. 

Thus, a set of incongruent solutions modulo the original modulus 18 is {6,15}. The 
set of all integer solutions is {6 + 9k: k € Z}. 


6.7*. The Chinese Remainder Theorem 


Can we solve for an integer being congruent to various integers modulo various 
moduli? Equivalently, can we solve systems of linear congruence equations? For 
example, does there exist an integer x such that 


(6.76a) x = 8 mod 24, 
(6.76b) x = 45 mod 23? 


Yes. It turns out that the general solution to this system of two congruence equa- 
tions is given by 


(6.77) av = 344 mod 552. 


Observe that 552 = 24- 23. 

On the other hand, the system of equations 
(6.78a) x = 8 mod 24, 
(6.78b) x = 13 mod 45 


has no solutions! This is because the first equation implies that x = 2 mod 3, 
whereas the second equation implies that « = 1 mod 3. 


Now let’s analyze the first system (6.76) of congruence equations. Firstly, 
Zo = 344 is a solution, as well as is any x satisfying x = xp mod 552. Conversely, 
suppose that x is a solution to (6.76). Then x = xq mod 24 and x = 2p mod 23. 
Since 23 and 24 are coprime, this implies that x = x) mod 24-23 = 552. Thus, the 
solution 344 is the unique solution modulo 552. 


Secondly, how can one find the solution 344? The general solution to the first 
equation is x = 8+ 24k, where k € Z. We can plug this into the second equation 
to obtain 

8+ 24k = 45 mod 23; that is, 24k = 37 mod 23. 


Since 24 and 23 are coprime, this congruence equation has a solution. Note that 
24 = 1 mod 23, and so k = 37 is a solution. We conclude that « = 8+ 24-37 = 896 
is a solution to (6.76). Now 896 — 552 = 344, so that 344 is also a solution. 


The following result appeared in the treatise|Siinzi Suanjing| (The Mathematical 


Classic of Master Sun). This result gives us a sufficient condition for solutions to a 
system of linear congruence equations to exist, but it doesn’t tell us how to solve 
them! The reader interested in learning how to solve such systems of congruences 
may consult the “Chinese Remainder Theorem” Wikipedia link. 


6.7*. The Chinese Remainder Theorem 279 


Theorem 6.53 (Chinese Remainder). Let my ,12,...,mx be pairwise coprime in- 
tegers greater than 1. That is, for each i A j, m; > 1 and gcd(m;,m,;) =1. Then 


for any integers b,,b2,...,b%, the system of linear congruence equations 
(6.79a) x= b; mod m, 

(6.79b) x= bo mod mz, 

(6.79c) i 

(6.79d) xv = by mod mz 


has a solution xo € Z. 
Moreover, if Xo is a solution, then x € Z is a solution to (6.79) if and only if 


(6.80) x= 2 mod mymy--+ Mp. 

In other words, (6.79) has a unique solution modulo mymz---mMr. 

Proof. (1) Uniqueness of the solution. Suppose that x and zo are integer solutions 
to (6.79). Then we have 


x — 29 =O0Omod m1, 


x — ty =O mod mo, 


L — Lp = 0 mod mz. 


Thus, each of m1,™mz2,..., mx divides x — xo. Since the m,’s are pairwise coprime 
by hypothesis, from Corollary we have that the product mimz2---m, divides 
x — Xo. We conclude that 


® = 2X0 mod mym2:-: Mz. 


Conversely, it is easy to see that if 2 is a solution to (6.79) and if x satisfies (6.80), 
then z is a solution to (6.79). This completes the uniqueness proof. 


(2) Existence of a solution. Recall that ry, :Z— Rm denotes the remainder 
function. Define the function 


(6.81) ft Rinimo--m, > Rm, X Rm, X+°+ X Rn, 
by 
(6.82) F(x) = (tm, (&), Pm2(Z) -- +, Pm, (2). 


Observe for the function f that both its domain Ry.m,...m, and its codomain 
Rm, X Rm. X +++ X Rm, have the same cardinality mjm2---m, (the latter by the 
multiplication principle for sets (Corollary [7.6] below)). 


By part (1) of the present proof, the function f is an injection. Indeed, the 
argument of part (1) says that if x,20 © Rmym--m, are such that f(x) = f(zo), 
that is, tm,(%) = m,(o) for all 1 <i <k, then rmymo-m,z(L) = Vmymo---my (LO)- 
Since 2,20 € Rmym2---m,, this implies x = 2, so that f is an injection. Thus, by 
Theorem 5.63] f is a surjection (and hence a bijection)! 
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Finally, given any integers b),bo,...,b,, since f is a surjection, there exists 
LE Rmiyms--m, Such that 
(6.83) f(x) = (tm, (1), Pms (b2) eS es (b;,))- 


This implies that r,,,(@) = rm,(b;) for all 1 <i <k, so that x = b; mod mj; ie., x 
is a solution to (6.79). This completes the existence proof. 


Exercise 6.28. Let p1,p2,...,Dr be distinct primes. Prove that for every integer 


by, bo,..., bx, the system of linear congruence equations 
(6.84a) x= b; mod py, 
(6.84b) xv = bz mod po, 
(6.84c) : 

(6.84d) x = by mod pr 


has a unique solution modulo p,p2--- pr. 


6.8. Quadratic residues 


This section is an introductory discussion of quadratic equations on the finite set 
Rm, which are important in number theory. We will return to this topic later in 
the book. 


6.8.1. Definition of quadratic residue and examples. Let m > 3 be an integer. 
Define the function f : Ry, > Rm by 


(6.85) f(z) =r(2’), 
where r: Z > Ry is the remainder function. In other words, we are considering 


the squaring function modulo m. 


Definition 6.54. Let m > 2 be an integer. We say that z € R,, is a quadratic 
residue if z is an element of the image of f, that is, if there exists x € Rm such 
that 2? = z mod m. 


Let 
(6.86) QRm = Im(f) — {0} 


denote the set of non-zero quadratic residues. 


In other words, a quadratic residue is a perfect square modulo m. 


Example 6.55. Our discussion of quadratic residues actually began in 44.1.7jon di- 
vision and squares. The results of Exercises [4.8] [4.10] and and Solved Problem 
[4.1] are summarized by Table [4.1.1] which says 


QRe = {1}, 
QR3 = {1}, 
QRa = {1}, 
QRs = {1,4}, 


QR, = {1,2,4}. 
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Exercise 6.29. Show that 


QRe = {1,3, 4}, 
QRs _ {1,4}, 
QRg = {1,4, 7}, 


QRio = {1,4,5, 6, 9}. 


Explain why this exercise, together with Example [6.55| show that for each 3 < 
m<10, f: Rm 7% Rm is neither injective nor surjective. 


Solved Problem 6.56. Find all positive integers x such that x? = 4 mod 6. 


Solution. Firstly, x? = 4 mod 6 if and only if there exists an integer g such 
that 2? = 6q¢ +4. Let x be a solution. Since 2? is even, we have that «x is 
even. This implies that x is congruent modulo 6 to 0, 2, or 4. If = 0 mod 6, 
then x? = Omod6. If = 2mod6, then 2? = 4mod6. If x = 4mod 6, then 
x? = 16=4 mod 6. Thus z is a positive integer satisfying x? = 4 mod 6 if and only 
if ¢ =6k+r, where k € Z2 and r = 2 or r = 4. Another way to characterize these 
integers is that they are the even integers that are not multiples of 6. 


Exercise 6.30. Find all positive integers x such that x? = 36 mod 5. 


6.8.2. Elementary properties of quadratic residues. We have the following 
property regarding products of quadratic residues. 


Exercise 6.31. Show that if x,y € QRm, then r(ay) € QRm provided xy is not a 
multiple of m. 
In particular, if x € QRm, then r(x?) € QR» provided x? is not a multiple of 


Mm. 


Example 6.57. For m = 9, we have 


(6.87) ra y=7, wPyH4.. raeT) =H 1. 


Quadratic residues have the following nice properties modulo primes p at least 
3. As a special case, you may consider p = 23 as in Solved Problem 


Solved Problem 6.58 (Quadratic residues modulo an odd prime). Let p be an 
odd prime. Define the mod p squaring function f : Rp — {0} > Rp — {0} by 
(6.88) f(a) =r(a’). 
Prove the following: 

(1) f ts well-defined. 


(2) (f is 2-to-1) For each r € Im(f), there exist exactly two elements of Rp — {0} 
whose images under f are equal to r. 


(3) Im(f) = f(R, — {0}) has cardinality (number of elements) equal to (p—1)/2. 


Solution. (1) To see that f is well-defined, we just need to show that for every 
a € R,—{0}, f(a) 4 0. Suppose, for a contradiction, that there exists a € R, — {0} 
such that r(a?) = f(a) = 0. Then p divides a?. Since p is a prime and by Corollary 
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this implies that p divides a. Since a € Rp, we conclude that a = 0, which is 
a contradiction to a € R, — {0}. 

(2) Suppose that a,b € R, — {0} satisfy f(a) = f(b). Then r(a”) = r(b?); that 
is, p divides a? — b? = (a— b)(a+b). 

Case 1: p divides a — b. Since a,b € Rp — {0} C Rp, this implies that a = b. 

Case 2: p divides a+b. Since a,b € Ry, — {0}, this implies that a+b = p; that 
is, b= p—a. Observe that since p is odd, we have a £ b. 

Conversely, if a = b or if a+b =p, then one easily checks that f(a) = f(b). 

Hence, we have proved that f(a) = f(b) if and only ifa = b ora+b=p. This 
proves that the function f : R, — {0} > R, — {0} is two-to-one; that is, for each 
r € Im(f), there exist exactly two elements of R, — {0} whose values under f equal 
r. 


(3) Consequently, by a counting argument, the cardinality of Im(f) is equal to 
one-half of the cardinality of R, — {0}; ie., | Im(f)| = (p — 1)/2. 


Example 6.59 (Perfect squares modulo 11). For m = 11, we have 

QR = ty 3, A, 5, ae 
which has cardinality 5. We have f(1) = f(10) = 1, f(2) = f(9) = 4, f(3) = 
£(8) =9, f(4) = f(7) =5, and f(5) = f(6) =3. 
Exercise 6.32. Suppose that x is a quadratic residue modulo ab, where a and b are 
positive integers. Letra: Z— Ry and ry: Z— Ry denote the remainder functions. 
Show that rq(x) is a quadratic residue modulo a and rp(x) is a quadratic residue 


modulo b. In other words, a quadratic residue modulo an integer is a quadratic 
residue modulo a divisor of that integer. 


Now suppose, for example, that m is a prime or a product of two distinct 
primes. Define, as before, 


f= QRm > QRn 
by f(x) = r(x?) for x € QRm.- 


Exercise 6.33. Using that m is a prime or a product of two distinct primes, show 
that f is well-defined. 


Exercise 6.34. Continuing with the previous exercise, investigate under the addi- 
tional restriction that 3 < m < 10 whether for each m the function f : QRm 7 
QR» is a bijection. 


6.8.3. Quadratic residues modulo a product of distinct primes. Consider, as 
above, the squaring function on the set of quadratic residues. 


Example 6.60. The function f : QR21 > QR21 defined by f(x) = r(x?) satisfies 
QRa = {1,4,7,9, 15, 16,18} and 


fl) =1, f(4) = 16, (7) = 7, f(9) = 18, f(15) = 15, f(16) = 4, f(18) = 9. 
Thus, f is a bijection. 
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Observe that 21 = 3-7. Now suppose more generally that m = pq, where p and 
q are distinct primes. Suppose that z,y € QRpq satisfy f(x) = f(y). By definition, 
we have that 


(6.89) x? = y? mod pq. 


Let us try to show that x = y; that is, show that f is injective. We will have to 
impose a condition on the primes p and gq to accomplish this (see Theorem [6.62] 
below, generalizing Example [6.60] which this is leading to). 


Suppose for a hypothetical contradiction that « # y. Since x,y € QRpq, this is 
equivalent to the assumption that 


(6.90) x #y mod pq. 
Exercise 6.35. Show that 
(6.91) xc#ymodp or x«#ymodq. 


By Exercise [6.35] without loss of generality, we may assume that 


(6.92) x #y mod p. 
On the other hand, by (6.89), 
(6.93) x? = y? mod p. 


Exercise 6.36. Show that y has an inverse z modulo p. Hint: Can you derive a 
contradiction from assuming that p divides y? 


Show that z € QRpq.- 

We have, for the inverse z of y modulo p from Exercise [6.36] that 
(6.94) (xz)? = 1 mod p. 
Exercise 6.37. (1) Explain why (6.94) implies that 


(6.95) “z=1modp or xz=-1modp. 


(2) Explain why we must have that xz = —1 mod p. Hint: Appeal to (6.92) and 
the uniqueness of the inverse modulo p. 


Now, since x and z are non-zero quadratic residues, we have that xz is a non- 
zero quadratic residue, i.e., 7z € QRpg, which implies that xz is a quadratic residue 
modulo p. Since xz = —1 mod p, we obtain that —1 is a non-zero quadratic residue. 


We now need the following result, which we do not prove. 


Proposition 6.61. If p is congruent to 3 modulo 4, then —1 is not a quadratic 
residue modulo p. 


By this result and the work above, we have proved the following. 


Theorem 6.62. Suppose that p and q are distinct primes that are both congruent 
to 3 modulo 4. Then the function f : QRpq + QRypq defined by f(x) = r(x?) is a 


bijection. 
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6.9. Fermat’s Little Theorem 


6.9.1. The statement of Fermat’s Little Theorem. We have considered ex- 
amples of the remainders of perfect squares after dividing by a positive integer m. 
One may ask is there are “special” exponents and moduli for which the remainders 
are nice. Here is a beautiful and fundamental result in this vein. 


Theorem 6.63 (Fermat’s Little Theorem). If p is a prime and if a is a positive 


integer which is not a multiple of p, then 
(6.96) a?! = 1 mod p. 
6.9.2. Examples of Fermat’s Little Theorem. Let us look at some examples. 
Clearly, for every prime p, 1?~! = 1 =1 mod p. 
(1) The prime p = 5: 
a) We have 24 = 16 =3-5+1=1 mod 5. 
b) We have 3+ = 81 = 16-5+1=1mod5. 
c) We have 44 = 256 = 51-5+1=1mod5. 


4 


he prime p = 7: 

(a) We have 2° = 64=9-7+1=1 mod 7. 

(b) We have 3° = 729 = 104-7+1=1 mod 7. 

(c) We have 4° = 4096 = 585-7+1=1 mod 7. 

(d) We have 5° = 15625 = 2232-7+1=1 mod 7. 

(e) We have 6° = (—1)®§ = 1 mod 7. 

6.9.3. The idea of the proof of Fermat’s Little Theorem. We consider the 


effect of multiplication of integers by a given integer a modulo p. At first glance, 
this topic has nothing to do with Fermat’s Little Theorem! 


Figure 6.9.1. (1607-1665). Wikimedia Commons, Public Domain. 
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For example, suppose that p = 5 and a = 2. Consider the set of non-zero 
remainders modulo 5, which are the integers {1,2,3,4} = Ny. Their multiples by 2 
are, in order, 


(6.97) {2, 4,6, 8}. 

Modulo 5, these are, in order, 

(6.98) {2,4,1, 3}. 

Observe that, as a set this is also equal to Ny. Likewise, multiplication by 3 yields 
(6.99) {3,6,9, 12} = {3,1,4, 2}, 

and multiplication by 4 yields 

(6.100) {4,8, 12,16} = {4,3, 2, 1}. 


In each of the cases above the set of multiples, modulo 5, is equal to the set Ny! 


More generally, suppose that p is a prime and a is an integer that is not a 
multiple of p. Consider each of the integers 1,2,3,...,p— 1 as representing both 
the number of a chair and the number of the person sitting on that chair. So we 
have p—1 chairs and p—1 people. Now change each person’s number by multiplying 
it by a and assigning the remainder to be the person’s new number. 


ie 


eose4 


Figure 6.9.2. Multiplication by 4 rearranges the six brothers in the 
six chairs. Tikz LaTeX code by Sigur answering the question “Draw- 
ing Super Mario Bros .... in LaTeX” in StackExchange. Licensed un- 
der Creative Commons Attribution-Share Alike 4.0 International License 
(https: //creativecommons.org/licenses/by-sa/4.0/deed.en). 


Figure [6.9.2] shows the anatine example of (but with no chairs 
removed!) with 6 Super Mario Bros.) brothers (Mario, Luigi, and their heretofore 
unknown four brothers Kevin, Jovani, Romeo, and Ignazio) and 6 chairs. Multi- 
plying each brother by 4 and taking the remainder modulo 7 assigns a new num- 
ber to each brother and hence a new chair (six chairs for six brothers instead of 


Seven Brides for Seven Brothers|). 
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We wish to prove this property in general. We start with a little help from our 
friends[] 


6.9.4. A proof of Fermat’s Little Theorem. The general musical chairs fact 
we use is the following. 


Lemma 6.64. Let m be a positive integer. If S is a (finite) set of integers such 
that no two integers in S are congruent to each other modulo m, then S has at most 
m elements. 

If, in addition, no integer in S' is congruent to 0, then S has at most m—1 
elements. In this case, if S has exactly m—1 elements, then S is congruent modulo 
m to Nm_1- 


Proof. Recall from (6.13) that for every integer a, it is congruent to its remainder: 
a=r(a) mod m. So, up to congruence, we can replace each element a of S by r(a), 
and we call the new set R. Since no two integers in S are congruent to each other 
modulo m, we have that no two integers in R are congruent to each other modulo 
m. Since RC Rm, by Lemma[6.4] this implies that the elements of R are distinct. 
By Lemma we conclude that 


(6.101) |S] =|R| <m. 

Next, if in addition no integer in S' is congruent to 0, then 0 ¢ R. This implies 
that 
(6.102) [S| =|R| <m-—1. 

Finally, if in addition S has exactly m—1 elements, then |R| = m—1. This and 


RC Rm — {0} = N»p_1 implies that R = N,,_1. We conclude that S is congruent 
modulo m to N»_1. 


A remarkable fact is that the modular arithmetic multiplication fact observed 
in the previous section for p = 5 and a = 2,3, 4 is true in general for every prime p 
and any positive integer a which is not a multiple of p. 


Lemma 6.65. Let p be a prime, and let a be a positive integer which is not a 
multiple of p. Then the set of multiples of a, 


(6.103) S := {a, 2a, 3a,...,(p— 1a}, 
is, up to congruence modulo p, the same as the set 
(6.104) Np-1 = {1, 2,3,...,p— 1}. 


Proof. We first show that no two elements of the set S in (6.103) are congruent 
to each other modulo p. Suppose that integers 1 < 7,7 <p—1 satisfy 


ia = ja mod p. 


Since a is not a multiple of p, it follows that gcd(a,p) = 1. Hence we can factor 
out the a in the congruence equation to get 


i=J7 mod p. 


1«With a Little Help From My Friends” is the title of a Beatles’ song. 
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By Lemma we conclude that 7 = j. Thus, any two integers in S cannot be 
congruent to each other. 
Moreover, if ia = 0 mod p, we obtain i = 0 mod p. Assuming i 4 0 mod p, we 
obtain ia 4 0 mod p. Hence, any integer in S cannot be congruent to 0 modulo p. 
Since S has p — 1 elements, by Lemma |6.64] we conclude that S is, up to 
congruence modulo p, the same as N,_1. 


Example 6.66. We have seen above with the six Super Mario Brothers that cor- 
responding to the prime p = 7 and the positive integer a = 4, the set of multiples 
of 4, 

S = {4,8, 12, 16, 20, 24}, 
is, up to congruence modulo 7, the same as the set 


Neo = {1, 2,3, 4, 5, 6}. 
We are now ready to prove: 


Theorem 6.67 (Fermat’s Little Theorem). If p is a prime and if a is a positive 
integer which is not a multiple of p, then 


(6.105) a?-! = 1 mod p. 
Proof. Since p is a prime and p does not divide a, by Lemma [6.65] we have that 
“modulo p” the set of integers 
S = {la,2a,...,(p—1)a} 
is “the same” as the set 
Np-1 = {1,2,...,p—1}. 
That is, each integer in S is congruent to exactly one integer in N,_1 (so different 


integers in S are congruent to different integers in N,_1). The KEY observation is 
that this implies that 


la-2a---(p—1)a=1-2---(p—1) mod p. 
That is, 
(p —1)!a?~' = (p—1)! mod p. 
Since p is a prime, by Corollary [2.18] we have 
(6.106) gcd(p, (p — 1)!) = 1. 
Hence, by Corollary [6.21] we conclude that 


a?-!=1 mod p. 


Example 6.68. Taking p = 7 and a = 4 as in Example [6.66] we have that 
6!-4®&=4.8-12-16-20-24=1-2-3-4-5-6=6! mod 7. 
Since gcd(6!,7) = 1, this implies that 
a° = 1 mod 7. 


We have the following characterization of primes using modular arithmetic and 
factorials. 
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Theorem 6.69 (Wilson’s Theorem). Let n > 2 be an integer. Then 


(6.107) (n—1)!=-lmodn © nis a prime. 


Proof. (1): = We prove the contrapositive of this implication. Suppose that 
n = ab, where a > 1 and b > 1, is a composite number. Suppose for a contradiction 
that (n — 1)! =—1modn. Then, by Lemma[6.5]we have that 


(6.108) (n — 1)! = —-1 mod a. 


But since 1 < a < n—1, we have that a divides (n—1)!, so that (n—1)! = 0 mod a. 
Since a > 2, we have a contradiction. We conclude that (n — 1)! 4 —1 mod n. 

(2): < Let n be a prime. If n = 2, it is easy to see that the implication is 
true. So we may suppose that n is an odd prime. Since n is a prime, by Corollary 
[6.43] each of the (even number of) integers 1,2,...,n — 1 has an inverse modulo n. 
Furthermore, by Lemma|6.44] the only integers of these which are their own inverse 
are 1 and n—1. Thus, the (even number of) integers 2,...,n—2 may be paired up 
with their inverses, where the product of each pair a,a~! satisfies a~'a = 1 mod n. 
We conclude that 


(6.109) 2-3---(n—2) =1modn, 
and hence 
(6.110) (n —1)!=1-(n—-1) =—-1 mod n. 


Example 6.70. We have for the prime p = 7 

(6.111) 6! = 720 = —1 mod 7 

since 721 = 7-103. Here, the pairs of inverses are (2,4) and (3,5). 
And also, for the prime p = 11, 

(6.112) 10! = 3628800 = —1 mod 11 


since 3628801 = 11 - 329891. Here, the pairs of inverses are (2,6), (3,4), (5,9), and 
(7,8). 

And another(!), for the prime p = 13, 
(6.113) 12! = 479001600 = —1 mod 13 


since 479001601 = 13 - 36846277. We leave it to you, dear reader, to list the pairs 
of inverses modulo the prime 13. 


Exercise 6.38. Let n > 2 be an integer. Observe that (n!)? + 1 is an odd integer. 
Let p be a prime divisor of (n!)? + 1. 
) Prove that p is odd and p> n. 
) Prove that (n!)?~! = (1) mod p. Hint: What is (n!)? mod p? 
3) Prove that (n!)?~! =1 mod p. Hint: Fermat’s Little Theorem [6.67] 
) Prove that p = 1 mod 4. 
) 


Explain how we can use the above to prove that there are an infinite number 
of primes p satisfying p= 1 mod 4, thus proving Theorem [6.27] 


6.10*. Euler’s totient function and Euler’s Theorem 289 


6.10*. Euler’s totient function and Euler’s Theorem 


Definition 6.71. For any integer n > 2, let y(n) be the number of positive integers 
k less than n satisfying gcd(k,n) = 1. The function 


(6.114) gp: Zt a Zt 
thus defined is called /Euler’s totient function} 
Observe that 1 < y(n) < n— 1. Define the set 
(6.115) Re:={ke€Z:1<k<n-—1 and k and n are coprime}. 


We have that y(n) is equal to the number of elements (cardinality) of the set R*. 
Note that R*, is a subset of the remainder set R, = {0,1,2,...,n— 1}. We may 
write 


(6.116) R* := {k © R, : gcd(k,n) = 1}. 
By Theorem [6.48] R* is the set of remainders that have inverses modulo n. 


Example 6.72. If p is a prime, then 


(6.117) y(p) =p-1. 
So 
(6.118) y(2) =1, ¥(3) =2, v(5) =4, (7) =6, (11) = 10, ete. 


Example 6.73. For the integer n = 15, the integers between 1 and 14 inclusive 
that are coprime with 15 are (see Example [6.49) 


(6.119) 1,2,4,7,8, 11, 13, 14. 


Hence, we count that y(15) = 8. 


Figure 6.10.1. Leonhard Euler (1707-1783). Wikimedia Commons, Public Domain. 
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Wikipedia gives a table of values of y(n) for 1 <n < 100. They are 


+1 42 43 44 +5 +6 +7 +8 49 +410 
0}; 1 1 2 2 4 2 6 4 6 4 
10; 10 4 12 6 8 8 16 6 18 8 
20) 12 10 22 8 20 12 18 12 28 8 
30] 30 16 20 16 24 12 36 18 24 = 16 

(6.120) 40| 40 12 42 20 24 22 46 16 42 = 20 
50] 32 24 52 18 40 24 36 28 58 16 
60] 60 30 36 32 48 20 66 32 44 24 
70 | 70 24 72 36 40 36 60 24 78 32 
80] 54 40 82 24 64 42 56 40 88 24 
90| 72 44 60 46 72 32 96 42 60 40 


We observe formula (6.117) for the primes, such as 
p(67) = 66. 


We will use right below the following result. 


Exercise 6.39. Let p be a prime and let k be a positive integer. Prove that ifm 
is a positive divisor of p*, then m is equal to one of the integers 1,p,p?,...,p*. 


In view of the Prime Factorization Theorem, we first consider Euler’s totient 
function for inputs that are powers of primes. 


Lemma 6.74 (Euler totient function of prime powers). If p is a prime and if k is 
a positive integer, then 


(6.121) y(p*) = p* (1 - ~) 


Pp 


Proof. Let m be any integer. Since gcd(m, p*) is a positive integer dividing p*, by 
Exercise [6.39] we have that gcd(m, p*) is equal to one of the following integers: 
(6.122) 1B, py 202, v”. 

In particular, it is easy to see from this that 

(6.123) gcd(m, p*) > 1 if and only if m is a multiple of p. 

Now, the multiples of p between 1 and p* inclusive are 

(6.124) 1p, 2p, 3p,..., p*—"p. 

We see that there are exactly p*~! such multiples (corresponding to 1,2,3,..., 
p*-1). That is, there are exactly p*—! integers m between 1 and p* inclusive 


satisfying gcd(m, p*) > 1. We conclude by the definition of Euler’s totient function 
that 


(6.125) v(p*) =p" —p** =p* (1 - ~) 
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Example 6.75. Consider 3° = 27. The multiples of 3 between 1 and 27 inclusive 
are 

3, 6,9, 12,15, 18, 21, 24, 27, 
which number nine in all. We have 


(27) = |{1, 2,4, 5, 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, 26}| 
1 
=18=27-. (1-5) = 27-9. 


Exercise 6.40. Find all (unordered) pairs of inverses modulo 27. You may refer 
to Example |6.75} 


Secondly, we consider Euler’s totient function of products of coprime integers. 


Theorem 6.76 (Euler totient function for coprime products). Ifm,n € Z* satisfy 
gcd(m, n) = 1, then 


(6.126) (mn) = g(m)(n). 


Proof. Recall that y(n) is the number of elements of the set R* defined by (6.115). 
Formula follows from showing that the two sets R*, x R* and R*,,, have the 
same number of elements. By Theorem |5.62) it suffices to find a bijection between 
these two sets. In turn, by Theorem it suffices to find a function between the 
two sets which has an inverse. 


Define the function f : R*, x Rt — Rx, as follows. Let (a,b) € R*, x Re. 


Since gcd(m,n) = 1, by the Chinese Remainder Theorem[6.53} there exists a unique 
integer ro € Rm» satisfying 


(6.127a) Lo =amodm, 
(6.127b) Xo =bmodn. 
We define 

(6.128) f(a,b) = 20. 


Since (a,b) € R*, x R*, we have gcd(a,m) = 1 and gcd(b,n) = 1. Thus, by (6.127) 
and Lemma |6.23}) we have 

(6.129) gcd(xo,m) = gced(a,m) = 1, gcd(xo,n) = gced(b,n) = 1. 

By Lemma this implies that gcd(a9, mn) = 1. Hence f(a,b) = xo is indeed in 
the specified codomain R*,,,, so that the function f is well-defined. 


Next, we define the function g: R*,,, > Rt, x R* by 
(6.130) o(#) = (tm (2), Pn(2)); 
where r,: Z—> Ry», andr, : Z— Ry, are the remainder functions. Let x € Ri,,,. 
Then gcd(x,mn) = 1, which implies that gcd(x,m) = 1 and gced(a,n) = 1. Hence, 
by Lemma [6.23] again, we have that ged(rm(x),m) = 1 and gced(r,(z),n) = 1, so 
that r(x) € R*, and r,(a#) € R*. Hence, the function g is well-defined. 

Now we check that f and g are inverses of each other. Let (a,b) € Rt, x Re. 
As above, we use the notation rp := f(a,b). Then, by (6.127), 


(6.131) xo =amodm, xo =bmodn. 
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Hence, 
(6.132) lm(Xo) = a, rn(Xo) = b. 
This proves that g(f(a,b)) = (a,b). 
Now let x € R*,,,. Then 
f(g(@)) = f@m(2), Pn(x)) = @ 


since 


L=Y,(x) mod m, x =r,(x) mod n. 


We have proved that f and g are inverses of each other. The theorem follows. 


For example, the theorem implies that 
(6.133) 915) = 9(3)p(5) = 2-4=8, (30) = (5) y(6) =4-2=8. 


Thirdly, and finally, we can extend Lemma and Theorem to the 
following beautiful general formula. 


Theorem 6.77 (Euler’s product formula). For any integer n > 2, 
1 
(6.134) y(n) =n (1 _ ~) , 
He; 


pin 


where the product on the right-hand side is over all distinct primes p dividing n. 


Proof. By the Prime Factorization Theorem, there exist primes p; < pg <--: < pp 
and positive integers k,,k2,...,k, such that 


(6.135) mp pepe, 


By Theorem [6.76] since gcd (pi, p;’) = 1 fori #4 J, we have 


(6.136) y(n) = (pt?) p(p5?) --- p(k"). 

Actually, we should be a bit more detailed in our justification: Firstly, since 
ecd(p}, ps? ---pkr) = 1, we have 

(6.137) y(n) = v(pt) y(n? --- pr"). 


Starting with this, we can prove (6.136) by induction on r. We leave this as an 
exercise. 


Now we are in a perfect position to apply Lemma [6.74] Indeed, by applying 
this lemma to (6.136), we obtain 


h 1 1 1 

ky ke k 
y(n) =p 1-2)» (1-=)--- pb (1-=). 
in) ( pj** P2 Pr 


The lemma now follows from and the fact that p,,p2,...,p, are the primes 
dividing n. 
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Example 6.78. Since the only prime divisors of 18 are 2 and 3, by Theorem 
we have 


1 1 1 2 
wl 18)=18(1-——= 1—=)=18-~=-=~=6. 
(6.138) (18) s( al 3) B55 =6 
Similarly, 36 = 273? satisfies 
1 2 
(6.139) (36) = 36- 5-5 = 12. 


Exercise 6.41. Use Theorem to compute Euler’s totient function y(n) for all 
composite numbers that are between 21 and 40 inclusive. You can also use Theorem 


[6.76] if you like. 


Recall that if m is prime, then y(m) = m— 1. A generalization of Fermat’s 
Little Theorem is|Euler’s Theorem (we will prove this theorem in §8.5.12|below ©), 


which says: 


Theorem 6.79. If positive integers a and m are coprime, then 


(6.140) a?(™) = 1 mod m. 


6.10.1. Reduced residue systems. 


Definition 6.80. A (finite) set of integers R is called a 


modulo m if the following are true: 

(1) For each r € R, ged(r,m) = 1. 

(2) The cardinality |R] = y(m). 

(3) Distinct elements of R are incongruent modulo m. 
Example 6.81. The set 
(6.141) R= {1,3,7, 9} 
is a reduced residue system modulo 10. We also have that 
(6.142) R' = {101{213) 747[909} 


is a reduced residue system modulo 10. 


Exercise 6.42. Let R= {r1,1r2,..-,1y(m)} be a reduced residue system modulo m. 
Prove that ifm > 2, then 

p(m) 
(6.143) S° rj; =0 mod m. 

i=1 


6.10.2. Schneider’s formula. Let a = 1+v5 be the golden ratio (as in (2.116)). 
Then a formula of Schneider’ is 


(6.144) a= s - (1 =): 


We do not prove this formula. © 
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6.11*. An application of Fermat’s Little Theorem: The RSA algorithm 


In this section we discuss an application of Fermat’s Little Theorem to cryptogra- 


phy, namely, the RSA algorithm, named after [Rivest and Our 


discussion is only a tiny glimpse of the application of number theory to cryptogra- 
phy. 

Let 
(6.145) A={A,B,C,...,Z}, B = {65, 66, 67,..., 90}. 
Define function f : A > 6 to be the unique bijection which takes the alphabetical 
ordering of A to the numerical ordering of B, so that f(A) = 65, f(B) = 66, etc 


For example, consider the word] 
BLADERUNNER, 


which is a string of 11 letters. Our friend Leon Kowalski would like to secretly send 
this word to us via the internet, in a way that anyone intercepting his message will 
not be able to decipher it. 


By using the function f, the word BLADERUNNER corresponds to a string of 
11 2-digit integers: 
(6.146) 66 76 65 68 69 82 85 78 78 69 82. 
This forms a 22-digit integer 
(6.147) a := 6676656869828578786982 


from which we can recover the word BLADERUNNER by applying the inverse 
function f~! : B — A to each of the 11 2-digit integers comprising a. So the 
question becomes: How do we enable our friend Leon to encrypt the integer a so 
that only we can decrypt it? 


Let p and q be large and distinct primes. For example, you could take (but not 
advisable!) p and q to be the twin primes in (1.90). Let 


m = pq. 
By Theorem [6.76] 
(6.148) y(m) = (p—1)(q- 1). 
On the other hand, without knowing p and q, it would be very hard to compute 
p(m). 
Next, we choose a positive integer e that is coprime with v(m); i.e., 
ged(e, p(m)) = 1. 
By (6.36) there exists a positive integer d such that 
(6.149) de = 1 mod y(m). 


Choose such an integer d. 


? This is the [ASCII code 
3Of course, we could choose another word, such as: MACGUFFIN 
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Definition 6.82. The |public key] is the pair of integers 
(6.150) (m, e). 

The private key is the triplet of integers 

(6.151) (p,q, d). 


The public key is used to encrypt an integer a and the private key is used to 
decrypt the resulting encryption. This is how it is done. We choose m so large 
that 


a<m 


for every positive integer a we wish to encrypt and decrypt. The encryption is given 
by defining 


(6.152) b:= r(a°), 


where r: Z — R,,, is the remainder function. The key is that we can recover a from 
b by the formula 


(6.153) a=r(b"). 
To see this, we observe that (6.152) implies that 
b=a*° mod m. 
Thus 
b¢ = a® mod m. 
By (6.149), there exists an integer k such that 
(6.154) de -—1=ky(m). 


On the other hand, by Euler’s Theorem |6.79} 


(6.155) abe) = 1" =1mod m. 
Therefore, 
(6.156) b¢ = a! =amodm. 


Since 0 < a < m, this implies (6.153). 


6.12*. The Euclid—Euler Theorem characterizing even perfect numbers 


As we have just seen, Euler’s totient function ¢ is very useful. In this section, using 
the |arithmetic function o defined in the next paragraph, we give a well-known 


characterization of even perfect numbers. 
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6.12.1. Sum of positive divisors function. The sum of positive divisors func- 
ltion| is defined for m € Z* by 


(6.157) o(m):=)—d, 
dl 


where the sum is over all positive divisors d of m. In general, a divisor function is 
an arithmetic function which depends on the divisors of the positive integer input. 


Firstly, the following result is immediate from the definition of o. 


Lemma 6.83. If p is a prime, since its only positive divisors are 1 and p, we then 
have 


(6.158) o(p)=p+1. 


A table of values of the sum of positive divisors function o for the first 50 

positive integers is 

+1 +2 43 44 +45 46 47 +8 +9 +410 
0; 1 3. «4 7 6 12 8 15 138 =~ «18 
10); 12 28 14 24 24 31 #18 39 20 = 42 
20| 32 36 24 60 31 42 40 56 30 = 72 
30} 32 63 48 54 48 91 38 60 56 90 
40| 42 96 44 84 78 72 48 124 57 ~ 98 


(6.159) 


For example, this table says that 
o(35) =14+54+7+35 = 48 


and that 
o(36) =14+2+34+44+649+412418+36= 91. 


Secondly, we have the following reduction formula for o of the product of co- 
prime integers. 


Lemma 6.84. Ifa and b are coprime positive integers, then 


(6.160) a(ab) = o(a)a(b). 
Proof. Let {c;}#_, = {c1,c2,...,¢,} be the set of positive divisors of a, and let 
{dj}ha1 = {d,,do,...,de} be the set of positive divisors of b. Then (exercise!), since 
a and 6b are coprime, the set of positive divisors of ab is equal to the set 
(6.161) (Cady exe eyes. 
From this we conclude that 
k ¢ k 

(6.162) o(ab) => eid; = ¥> 4; Yd; = (a) o(0). 

i=1 j=l i=1 j=l 


For example, 0(42) = o(6) 0(7) = 12-8 = 96. 
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By Lemmas and [6.84] we have the following. 


Corollary 6.85. [fm = pq is a semiprime, that is, p and q are distinct primes, 
then 


(6.163) a(m) =(p+1)(q+1)=m4+14+pta 
For example, 0(35) = 0(5- 7) = 35+1+5+7 = 48. 
Thirdly, generalizing Lemma [6.83] we have: 


Lemma 6.86. If q is a prime and if k is a positive integer, then 


k+1 
qd -1 
164 1 nee 
(6.164) a(g') = 
Proof. The positive divisors of q* are 
(6.165) 1, if Oca) Q"- 
Thus, 
, k : gt =] 
na ‘= ‘= =—__ 
(6.166) OD le 


For example, o(27) = 0(3?) sat 40. 
Fourthly and finally, from Lemmas|6.84] and [6.86] we have the following general 


formula for co. 


Corollary 6.87. Ifn = pi pk .-- pr is the prime factorization of an integer n > 2, 
then 
Pp kitl _ 1 
(6.167) sa = 
ei Pe= 


Example 6.88. Using Corollary [6.87] we compute that 
o(30) = o(2-3-5) =3-4-6=72, 
o(35) = 0(5- 7) =6-8 = 48, 
ead 33-1 


0(36) = 0(2°3") = 5 <= 


6.12.2. Statement and proof of the Euclid—Euler Theorem. Recall from 
that a positive integer n is a perfect number if n is equal to the sum 
of its positive divisors less than n. In terms of the sum of divisors function, this 
says that 


(6.168) n=o(n)—n; 
that is, we have: 
Lemma 6.89. A positive integer n is a perfect number if and only if 


(6.169) a(n) = 2n. 
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We have the following characterization of even perfect numbers. 


Theorem 6.90 (Euclid—Euler Theorem). An even positive integer n is a perfect 
number if and only if 
M,(M, + 1) 


(6.170) n=2?-'QP?-1)= 5 


where My := 2? — 1 is a Mersenne prime! 


Proof. (<) Suppose that 2? — 1 is a Mersenne prime. We then have that 
(6.171) gced(2?-*, 2? -1) =1 


since 1 < 2?-! < 2?—1. Thus, by LemmaJ6.84] we have the sum of positive divisors 
satisfies 


(6.172) o(2?-1(2? — 1)) = o(2?-") a (2? - 1). 

By Lemma |6.86] we have 

(6.173) o(2?-') = 2? -1, 

Since 2? — 1 is assumed to be prime, we have 

(6.174) o(2? — 1) = 2?. 

Therefore, by plugging (6.172) and (6.173) into (6.171), we obtain 
(6.175) o(2?-1(2? — 1)) = (2? —1)2? = 2. 2”-1(2? — 1). 


Thus, by the characterization (6.169), 2?~1(2? — 1) is a perfect number. 

(=) Suppose that n is an even perfect number. Then we may write n as 
(6.176) n= 2" sg 
where k > 1 and q is an odd integer (i.e., ged(q,2) = 1). Since 2" - q is a perfect 
number, by (6.169) and by Lemmas [6.84] and [6.86] we have 
(6.177) att. g = 2.98. g = o(2*- 4) = (21 — 1) 0(g), 
where we also used that gced(q, 2") = 1. Observe that since k > 1, we have 2*+1!—1 > 
3. Moreover, since 2*+! — 1 is odd and divides 2*+! - q from (6177), by Theorem 
[4.23] we have that 2"+! — 1 divides q. 

Now, since 2*+! — 1 > 1 is a positive divisor of q, we have that 


ee el) 
~ 9k+1 | Qktl 


(6.178) r: 
(where we used (6.177) for the second equality) is a divisor of q which is less than 
q. Hence, r and q are distinct positive divisors of g. This implies that 

(6.179) o(qg)>r+q, 


with equality if and only if r and q are the only divisors of g. Note that in this 
equality case we have r = 1 and q is prime. 


Now, (6178) and (6.179) imply 


(6.180) 2'+1y = o(g) >rtqart (Qt 1p = 2p, 
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This implies that o(q) = 7 +4, which in turn implies that 


(6.181) go Oh a 1)pa a1 
is a prime. We are done since we now have by (6.176) that 
(6.182) nao 1), 


Finally, we let p =k +1 to get n = 2?-1(2? — 1), where 2? — 1 is prime. 


Remark 6.91. Euclid proved the “if” implication direction of the theorem, whereas 
Euler proved the “only if” implication direction of the theorem. This is even though 
just about every mathematician between Euclid and Euler believed the “only if” 
implication to be true! 


6.13*. Twin prime pairs 


Recall that we encountered twin primes in More importantly, the twin prime 
conjecture that there are an infinite number of twin primes is still unsolved! We 
have the following result which we will not prove. © 


Theorem 6.92. A pair (m,m+2), where m > 2, is a twin prime pair if and only 
if 
(6.183) 4((m — 1)!+ 1) = —m mod m(m + 2). 


For example, for m = 3 we have 
(6.184) 4((3 — 1)! + 1) = 12 = —3 mod 3(3 + 2) = 15, 
corresponding to (3,5) being a twin prime pair. For m = 7 we have 
(6.185) 4((7 — 1)! + 1) = 2884 4 —7 mod 7(7 + 2) = 63 
since 2891 = 63 - 45 + 56, corresponding to (7,9) not being a twin prime pair. For 
m = 11 we have 
(6.186) A((11 — 1)! + 1) = 14515204 = —11 mod 11(11 + 2) = 143 
since 14515215 = 143 - 101505, corresponding to (11,13) being a twin prime pair. 


Factorials quickly get large. For example, the next smallest twin prime pair is 
(17,19). We compute that 


(6.187) 4(16! + 1) = 83691159552004 = —17 mod 323 
since 83691159552021 = 323 - 259105757127. 


6.14. Chameleons roaming around in a zoo 


We conclude this chapter with a fun puzzle regarding the interactions of social 
chameleons, which can be solved using modular arithmetic. 


Solved Problem 6.93. There are several chameleons roaming around in the 
Of these chameleons, Ro are red, Bo are blue, and Go are green, 
where all of Ro, Bo,Go are positive integers. Whenever two chameleons meet, if 
they are the same color, then their colors remain unchanged after the meeting. 
On the other hand, if they are different colors, then both of their colors change 
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to the third color. Are there any restrictions on Ro, Go, and Bo for it to be pos- 
sible that all of the chameleons are eventually the same color? (Assume that the 


Fountain of Youth is in the San Diego Zoo and so the chameleons never die; that 


is, they are immortal!) 


Figure 6.14.1. Indian chameleon From Kanakpura, Karnataka. Photo credit: 
Wikimedia Commons, author: Girish Gowda, licensed under Creative Com- 
mons Attribution-Share Alike 4.0 International (https://creativecommons. 
org/licenses/by-sa/4.0/deed.en) license. 


Solution. If two same colored chameleons meet, then there is no change to the 
colors due to this meeting. So we just need to consider the case where two different 
colored chameleons meet. 


Suppose that a red chameleon and a blue chameleon meet. Then both of these 
chameleons become green. So, if R,B,G is the number of red, blue, and green 
chameleons right before the meeting, respectively, then 


(6.188) R:=R-1, B:=B-1, G:=G+2 


is the number of red, blue, and green chameleons right after the meeting, respec- 
tively. The key observation is that 


(6.189) R-B=R-Bmod3, B-G=B-Gmod3, G-R=G-— Rmod3. 


Indeed, this is a simple calculation using (6.188). Note that a (somewhat) trivial 
consequence of this is 


(6.190) B-R=B-Rmod3, G-B=G-—Bmod3, R-G=R-G mod 3. 


The displayed congruence equations above are also true: (1) when a blue chameleon 
and a green chameleon meet and (2) when a green chameleon and a red chameleon 
meet. That is, they are true whenever two different colored chameleons meet. On 
the other hand, when two same colored chameleons meet, they are true trivially. 


Let Ro, Bo, Go denote the initial number of red, blue, and green chameleons, 
and let R;, B;,G; denote the number of red, blue, and green chameleons after the 
i-th meeting, where 1 <i < n and n is the total number of chameleon meetings 
(which can be any positive integer). By (6.189), we have that modulo 3, R; — Bi, 
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B;-—G;, and G; — R; are independent of 7. In particular, the initial differences and 
the end differences are congruent modulo 3: 

(6.191) 

Ry, — By = Ro— Bo mod 3, B,- Gn = Bo -— Go mod 3, Gn —-— Rn = Go — Ro mod 3. 


Now suppose that after the n-th meeting, all of the chameleons are one color. 
Firstly, assume that this color is red. Then B,, = G, = 0, and so (6.191) says that 


(6.192) Rn = Ro — Bo mod 3, 0 = Bo — Gp mod 3, —Ry = Go — Ro mod 3. 


Notice that any one of these equations follows from the other two. The only re- 
strictive equation we have is the middle equation, which says that 


(6.193) Bo = Go mod 3. 


Similarly, if all of the chameleons are blue after the n-th meeting, then we must 
have that 


(6.194) Go = Ro mod 3. 


And, if all of the chameleons are green after the n-th meeting, then we must have 
that 


(6.195) Ro = Bo mod 3. 


Thus, if Ro, Bo, Go are all incongruent modulo 3, then it is impossible for all of 
the chameleons to ever be the same color. Moreover, if the numbers of chameleons 
for two different colors are not congruent modulo 3, then the chameleons can never 
all be the third color. 


Conversely, suppose that (at least) two of Ro, Bo,Go are congruent modulo 3. 
Then, without loss of generality, we may assume that Gp = Rp mod 3. In addition, 
suppose that Gop > Ro. We will show in this case that it is possible to arrange a 
sequence of meetings of the chameleons so that eventually all of the chameleons are 
blue. By symmetry in green and red, it follows that we can remove the assumption 
that Go = Ro. 

We first arrange for Ro pairs of green and red chameleons to meet. After these 
Ro arranged meetings, we have that 


(6.196) R,=0, Gi=Go—Ro, Bi =Bo+2Ro. 


Case 1: k := $Gi < Bi; ie., Gi < 3B). Note that since Go = Ro mod 3, we 
have that k is a non-negative integer. Since k < 6), we can arrange for / meetings 
between green and blue chameleons. After these k arranged meetings, we have that 


(6.197) Ry =2k, Go=2k, Bo=B,—k. 


Then all we have to do is arrange for 2k pairs of red and green chameleons to meet 
to obtain 


(6.198) R,;=0, G3=0, Bz; =B,+3k=Ro+Bot+Go, 


at which point we have all blue chameleons! 
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Case 2: k:= aGy > By; i.e., Gi > 3B). In this case we can arrange for B, 
green and blue chameleons to meet. After these 6, arranged meetings, we have 
that 


(6.199) R,=2Bi, Go=G,—B,, B.=0. 


Since G'; > 3B,, we have G2 > Rz. So we can arrange for Re of the red and green 
chameleons to meet. After these R2 arranged meetings, we have that 


(6.200) R3 =0, G3 =Go—-—R2=G,-3B,, B3 = By +2Ro =4B. 


Now, let ¢ := $G3, which is an integer. If £ = }G3 < Bs, we can use the 
argument of Case 1 to obtain (in two steps) all blue chameleons! This condition is 
equivalent to G, < 15B,. So we have reduced the problem to the case G; > 15B,, 
as compared to the previous reduction to the case G, > 3B8,. Continuing in this 
way, we have a sequence of reductions of the problem to the case G; > M;P,, 
i > 1, where if the sequence of reductions is not finite, then M; > oo as i > oo 
(we do not attempt a rigorous justification of this). Since the ratio of G; to B, 
is bounded, eventually these reductions must halt, and so we must have all blue 
chameleons after a finite number of reductions! 


6.15. Hints and partial solutions for the exercises 


Hint for Exercise Two integers a and b are congruent modulo 2 if and only 
if r(a) = r(b), where r : Z + Rz is the remainder function. Consider the following 
two cases: (1) r(a) = r(b) = 0 and (2) r(a) = r(b) = 1. 


Hint for Exercise (1) One way to see this is to observe that 


(6.201) 10 = 99---9+1=9-11---1+1. 
—~_— SS” 
k9’s k1’s 
(2) We have 
k — . a ae 
(6.202) 10° = 3- 33---34+1, 
k3’s 


Hint for Exercise Since a > 0 and b < m, so that —b > —m, we have 
(6.203) a—b=a+(—-b) >0+(—m) =—m. 
Continue. 


Hint for Exercise[6.4]| Suppose that 2 = y mod m and that a is a positive divisor 
of m. Then m divides x — y. Use that b divides m. 


Hint for Exercise (1) 
(6.204) 1002, 1015, 1028, 1041, 1054. 


Hint for Exercise [6.6] (1) Use that 
(6.205) 456456 = 456 - 1001 = 0 mod 1001. 


(2) Use that 567567 is a multiple of 1001, which in turn is a multiple of 13. 
(3) Use that 579583 = 579579 + 4. 
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Hint for Exercise Suppose that a; = ag mod m and by = bg mod m. Then 
m divides a, — a2, and m divides b, — bg. Use that 


(6.206) (ay = bi) — (a2 — bz) => (ay = az) — (by = bg). 


Hint for Exercise A key implication in proving the inductive step is that if 
a, = bo mod m and if 


(6.207) a, t+ag+-+++ap = bd, +02 +--+ + by mod m, 
then 
(6.208) (ay tag +--+ + ax) + angi = (01 + 62 +--+ +d) + bg41 mod m. 


Hint for Exercise [6.9] Use the hint, Exercise [6.2(1), Theorem[6.9(3), and Exer- 
cise [6.8] 


Hint for Exercise Use the transitivity of congruence. 
Hint for Exercise The answer is given by 


x7/O 1 2 3 4 5 6 
0/0 00 0 0 0 0 
1/0 12 3 4 5 6 
2/0 2 4 6 1 3 °5 
3/0 3 62 5 1 4 
4/0 4 15 2 6 8 
5/0 5 3 16 4 2 
6/0 65 4 3 2 1 


Hint for Exercise [6.12] No. Explain why. 

Hint for Exercise See Table [4.1.1] 

Hint for Exercise [6.14} 10° = 1* mod 9. 

Hint for Exercise[6.15} An integer is odd if and only if it is congruent to 1 mod 2. 


Hint for Exercise 1000 = —1 mod 13. 


Hint for Exercise Suppose ab = 1 modm and ac = 1modm. Using 
gcd(a,m) = 1 (why is this true?), show that b = c mod m. 


Hint for Exercise [6.18] The solutions, except for the m = 7 case (which con- 
tributes 4 rows), is given by the following table: 


Hint for Exercise |6.19} The smallest such solution is 2 and the largest such 
solution is 14. 


Hint for Exercise [6.20} We have 
(6.209) (m — a)(m — b) = ab+ m(m—a-— 0). 
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Hint for Exercise The answers are (explain why): 


(1) by = by mod 5. 
(2) by = by mod 7. 
(3) by = bg mod 45. 


Hint for Exercise If a prime p divides ab, then p divides a or p divides b. 


Hint for Exercise We present the solutions in a table: 


Hint for Exercise The integers b are 3,6,9,12. We have the following table 
of solutions: 


Hint for Exercise The pairs of inverses modulo 13 are 
(6.210) (1,1), (2,7), (3,9), (4,10), (5,8), (6,11), (12, 12). 


Hint for Exercise [6.26] 


(=) Suppose that a has the property that there exists an integer b such that 
ab =1 mod ™m. Then there exists an integer q such that 


(6.211) ab—1=qm. 


Show from this that a and m are coprime. 


(<=) Suppose that a and m are coprime. By Theorem [4.20] there exist integers 
b and c such that 


(6.212) (b)a + (c)m =1. 
Continue. 
Hint for Exercise The pairs (a, 6) such that ab = 1 mod 18 are 


(1,1), (5,11), (7,13), (11,5), (18,7), (17,17). 


Hint for Exercise If pi, p2,..-, px are distinct primes, then gcd(p;,p;) = 1 
fori # j. 
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Hint for Exercise For m = 6, we have modulo 6 the following congruences: 


i 1, 
os A. 
a3, 
7 gee 
57=1. 


Hint for Exercise Since 36 = 1 mod 5, the congruence equation in this 
exercise is equivalent to 7? = 1 mod 5. The set of all positive solutions is 


(6.213) {14+5k:k eZ}. 


Hint for Exercise [6.31] For the first part, we just need to show that r(ay) 4 0, 
which is true if xy is not a multiple of m. 


Hint for Exercise Suppose that x is a quadratic residue modulo ab. Then 
© =Yaqp(y”) for some y € Z. Now use that rg org, =Pa- 


Hint for Exercise f is well-defined if and only if r(x?) 4 0 for x € QRin. 
That is, m does not divide x?. Consider the two types of m’s, and for each type 
use properties of division and primes. 


Hint for Exercise Example [6.55] and Exercise [6.29] calculate for each 3 < 
m <10 the set QR. All of these integers m are primes or products of two distinct 
primes except for m = 4,8,9. By inspection, we see that f is not a bijection for 
m = 5,10 and that f is a bijection for m = 3,6, 7. 

We also observe that f is well-defined and a bijection for m = 4,9 even though 
they are not the product of distinct primes. 


Hint for Exercise |6.35} The contrapositive of the statement we want to prove 
is: (c = y mod p) and (a = y mod q) imply (a = y mod pg). One can prove this 
implication using standard properties of division and the fact that p and gq are 
distinct primes. 


Hint for Exercise Suppose p divides y. Then y = 0 mod p and y? = 
0 mod p. Derive a contradiction from this and from the equations that x and y 
satisfy. 
Hint for Exercise (1) Equation (6.94) implies that p divides the product 
(az — 1)(az +1). Continue. 

(2) If ez = 1 mod p, then z is an inverse of x mod p. Why is this a contradic- 
tion? 
Hint for Exercise [6.38] (1) It is easy to show that p is odd. Obtain a contradic- 
tion if p< n. 

(2) Show that (n!)? = —1 mod p and hence (n!)?-! = (—1)"=~ mod p. 

(3) Show that n! is not a multiple of p, and then use Fermat’s Little Theorem. 


pl 


(4) Show that (—1)" = 1 and that this implies that ?5+ is even. 
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(5) Suppose for a contradiction that there are only a finite number of primes 
that are congruent to 1 modulo 4. Let po be the largest such prime. Let p be a 
prime divisor of (po!)? +1. Explain why p > po and p= 1 mod 4 and why this is a 
contradiction. 


Hint for Exercise |[6.39}| By the Fundamental Theorem of Arithmetic, m is a 
product of primes. Let q be a prime in this product. Then q divides m. Since m 
divides p*, we have that q divides p*. This implies that q = p. Since m is positive, 
this implies that m is a non-negative power of p, so we write m = p*, where ¢ € Z2. 
Now, since p* divides p*, we have £ < k. 


Hint for Exercise The set of integers between 1 and 27 that are coprime 
with 27 is 


(6.214) {1,2,4,5, 7,8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, 26}. 


Hint for Exercise By (6.134), we have 


(6.215) y(n) =n][ (1 = *) ; 


p|n 


Theorem says that if m,n € Zt satisfy gcd(m,n) = 1, then 


(6.216) p(mn) = p(m)p(n). 
By (6.121), we have 

ky kf _ 1 
(6.217) y(p") =p (1 -) 


We also recall from (1.75) that the prime factorizations of the integers between 
21 and 40 are given by 
3°77 211 28 25 fF fis es oy 29 2235 


6.218 
( ) 3l 2 S:11 2-17 S57 23" SF fie B61 25 


We compute that 


(25) = 25 = 20, 

(26) = y(13) = 12, 

y(27) = 27 = 18, 

(28) = p(277) = 2-6 = 12, 
(30) 
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Hint for Exercise Let R = {r1,172,---,(m)} be a reduced residue system 
modulo m. Consider the restriction of the remainder map: 


(6.219) r:R- Rm. 


By hypothesis, |R| = |R»,| = v(m). Moreover, since distinct elements of R are 
incongruent modulo m, we have that r is an injection. Therefore r is a bijection. 


Now, for each 1 < i < y(m), there exists an integer q; such that 


(6.220) y= r(r;) +qm. 
Therefore, 
(m) (m) 
= (r(ri) + Gm) 
i=1 i=1 
(m) (m) 
= r(ri) +m S- qi 
i=1 i=l 
y(m) 
= yy r+m > Gi 
rERm, gcd(rjm)=1 i=1 


The result now follows from the claim that 7 .cr,, gc d(rym)=1" 18 divisible by m. 


To see this, observe that for r € Rp, 
(6.221) gcd(r,m)=1 S— ged(-r,m)=1 S-) ged(m—r,m)=1. 


We also have that if m > 2 is even, then 4 is not coprime with m. Thus, the 
elements r € Rm satisfying gcd(r,m) = 1 may be paired as (r,m—r) for r < m/2. 
Since each such r satisfies r + (m—r) = m, we conclude that )/.eR,,, ged(rym)=17 
is divisible by m. 


Chapter 7 
Counting Finite Sets 
1 

a + b 
la? + 2Qab + 1b? 
la? + 3a2b + 3ab? + 1b3 
lat + 4a2b + 6a2b? + 4ab? + 1b4 
la® + = 5a‘%b + 10a°b? + 10a2b%> + 5abt = +) 10? 

la®& + G6a®b + 15a4b? + 20a°b? + 15a2b4+ + G6ab®> + = 10% 

etc. 


Figure 7.0.1. The Binomial Theorem visualized as Pascal’s triangle! 


Goals of this chapter: Discuss the basics of counting finite sets. Prove 
the inclusion-exclusion principle for counting finite unions of sets. To understand 
binomial coefficients and to prove the Binomial Theorem. To count functions. 


Question: Are |monsters] good at math? 


Answer: Not unless you Count Dracula. 


Recall from Definition that a set X is finite if there exists a non-negative 
integer n and a bijection from N, := {1,2,3,...,n} to X (where No := 0). Fur- 
thermore, we say that X has cardinality n and we write this as |X| =n. 


7.1. The 


A key question is: How does cardinality relate to set operations? The addition 
principle answers this for the unions of disjoint sets. 
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7.1.1. The addition principle for two sets. Suppose that we have two baskets 
X and Y, each containing mysterious objects. How do we count the total number 
in the baskets altogether? Simple! We add the number of objects in each of the 
two baskets together. 


Theorem 7.1 (The addition principle for two disjoint sets). Suppose that X and 
Y are disjoint finite sets. Then X UY is finite and 
(7.1) |X UY| =|X|+|Y]. 


That is, if X and Y are disjoint finite sets, then the cardinality of the union is the 
sum of the cardinality of the sets. 


Proof. See Figure[7.1.1] for a visualization of this proof. 


Let n = |X| and let k = |Y|. By the definition of cardinality, there exist 
bijections f : N, + X and g: N; — Y. To prove the addition principle, we need to 
define a function 

hi Nnap 7 XUY 


and show that it is a bijection. We “stack” the function g on top of the function f 
to define h. Namely, we define h by 


._f Fl) ifl<i<n, 
(7.2) mii ={ gi—n) ifnt1l<i<ntk. 


It is easy to see that h is well-defined. We now show that h is a bijection, from 
which the theorem follows. 


(1) Proof that h is a surjection. Let ze X UY. Then z € X orze Y 
(exclusive “or” since X and Y are disjoint). So we consider two cases: 


(la) If z € X, then there exists i €¢ N, such that f(z) = z since f is a 
surjection. Since 1 <i <n, we have 
h(i) = fl) =z. 
(1b) If z € Y, then there exists 7 € N, such that g(j) = z since g is a 
surjection. Since 1 <7 <k, we haven+1<j+n<n+k and 
AG+n)=9(G+n)—n) =9(5) =z. 
This proves h is a surjection. 


(2) Proof that h is an injection. Suppose that h(i1) = h(i2) © X UY. 
Then h (i,) = A (ig) € X or h(i,) = h (ig) € Y. So again we consider two cases: 
(2a) Suppose h (i,) = h(i2) € X. Then 1 < i1,i2 <n. Hence f (i,) = f (é2). 
Since f is an injection, 
bs Seo 
(2b) Suppose h(ii) = h(t2) € Y. Then n+1 < t1,t2 < n+k. Hence 
g (i, —n) = g (ig —n). Since g is an injection, i; — n = ig — n, so that 


41 = 1. 


This proves hf is an injection. 
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n+k 
n ae iL 
k 
f f 
g — —> 
Fs 5 
2 r2 2 
i i \a1/ i 
Nz Y Nn XxX Nntk 


Figure 7.1.1. The addition principle visualized. 


Example 7.2. If X is a set of nm numbers and Y is a set of k letters, then X UY 
has cardinality n+ k. 


Exercise 7.1. Let m be a positive integer, and let N», = {1,2,...,m}. Prove that 
if X and Y are subsets of N», satisfying |X| + |Y| > m, then 


(7.3) XNY #90. 
7.1.2. Thinking about the addition principle for two sets. Can you convince 


yourself that the idea of the statement and proof of the addition principle (Theorem 
is encoded in Figure 


Ty r2 v3 In Y1 Y2 ¥3 Uk 
a 2 3 nm nt1ln+2n4+3 n+k 


Figure 7.1.2. Another visualization of the proof of the addition principle for sets. 


7.1.3. The addition principle for n sets. From the addition principle for two 
sets and by mathematical induction, one can prove: 


Corollary 7.3 (Addition principle for n disjoint sets). If X1,X2,...,Xn are dis- 
joint finite sets, t.e., X; Xj = 0 for alll <i<j<n, then X1UXQU---UX, is 
a finite set and 


(7.4) |X, U XgU---UX,| = |Xi] + | Xo] +--+ + | Xn]. 


Proof. Base case. The addition principle for two disjoint sets says that this is true 
for n = 2. 
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Inductive step. Suppose k > 2 is such that for every disjoint finite set X1,..., Xz 
we have |X, U---U X,| = |X1|+---+|X;,|. Let Yi,...,¥,41 be disjoint finite sets. 
Since Y, U---UY;, and Yz41 are disjoint finite sets, by the base n = 2 case, we have 


|¥, U---UYp41| _ \((¥,U---UY,) U¥e41| 
=|Y,U---UY;| + |Ypa1]. 
Since Y;,..., Y,; are disjoint finite sets, by the inductive hypothesis, we have 
l¥, U---UY,| = [Yi] +--- + [¥e]- 
Combining the two displays above, we have 
[Yr U-+-U Yaga] = |¥i U-+-U Yel + [Yaga 
= ([¥il +--+ +[¥el) + [Yesal 
= |[Yil+--++|¥iqil- 


Conclusion. By induction we are done. 


Example 7.4. Let m be a positive integer, and let X be a finite set of integers. 
Define X; := {a € X : 2 =imodm} for1<i<m. Then 


7.2. Cartesian products and the multiplication principle 
Let n > 2 be an integer and let X1, X2,...,X»n be sets. Recall that their n-fold 
cartesian product is defined by (5.28) as 

X41 x XQ Res x Xn = {(£1, 2,..-,; En) 1 LG EX; for i= 1 2y2 00,0} 


Recall also, as in (5.29), that X” is the n-fold cartesian product where X; = X for 
each i = 1,2,...,n. For example, as in (5.31) Euclidean n-space R” is the n-fold 
cartesian product of R. 


Theorem 7.5 (Multiplication principle for two sets). If X and Y are finite sets, 
then 
(7.5) |X x Y| = |X]|-|Y|. 


Proof. See Figure for a visualization of the proof. 


Let n = |X| and m = |Y|. Then there exist bijections f : N, — X and 
g:Nm— Y. Define h: N, x Nm -— X x Y by 


h(y,k) =(fV),9(k)) for 7 EN, and k € Ny. 
Then h is a bijection (exercise). Define b: N, x Nn > Nam by 
(7.6) b(j,k) =j7+(k-1)n for 7 EN, and KE Nn. 


Then b is a bijection (exercise; observe that 6(j,1) = j, b(j,2) =7 +n, b(j,3) = 
j +2n, etc.). Hence b has an inverse c: Nnm — Ny, x N,, which is a bijection. 
Therefore the composition hoc: Num + X x Y is a bijection. 


In the proof above, we counted the elements of a cartesian product using the 
invariance of cardinality under bijections. We can also use the addition principle 
for sets, as in the following exercise. 
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1l+(m—1)n 2+(m—1)n 3+(m-—1)n mn 
142n 2+2n 3+4+2n 3n 
l+n 2+n 3+n 2n 

I 2 3 n 


Figure 7.2.1. A visual proof of the multiplication principle (Theorem [Z.5). 
We can think of this figure as representing the bijection b defined by (7.6). 


Exercise 7.2. Give another proof of Theorem [7.5} Let |X| =n and let 


(7.7) X = {21,2%2,...,2n}. 

Show that 

(7.8) foxx Y > Linpxy 
i=1 

defined by 

(7.9) f(x,y) = (wy) 


is a bijection. Apply the addition principle for counting finite sets. 


By induction, we can soup up the previous theorem to the following. 


Corollary 7.6 (Multiplication principle for n sets). If X1,X2,...,Xn are finite 
sets, where n > 2, then 


(7.10) |Xy x XQ xX +++ xX Xp] = |Xa|-|Xea]--++ + |XnI- 


Proof. Base case. Theorem[7.5|says this is true for n = 2. 


Inductive step. Suppose k > 2 is such that for any collection of finite sets 
Xi, arate Xk we have |X4 x Xo xr & X;,| = |X1| . |X| ie [Xx]. Let Yi,. es Yeti 
be finite sets. Since Y; x --- x Y;, and Yx41 are finite sets, by the base n = 2 case, 
we have 

[¥y x +++ & Yogi] = |(¥% x +++ x Ye) x Yeqs| 
=|Y1 x +++ x Yel x [Yeral. 


Since Y;,..., Y,; are finite sets, by the inductive hypothesis, we have 


l¥y x +++ & ¥,| =]¥i] x +++ x [¥ eI - 
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Combining the two displays above, we have 
[Yr x +++ & Yeqgal = [Yi x +++ x Yel x [¥ega 
= (\¥i| x +++ x [Yel) x [¥e+a| 
= [Yi | xr & lYn+1| : 


Conclusion. By induction we are done. 


If we take all of the X; to be equal to a single set X, then we obtain: 
Corollary 7.7. If X is a finite set andn > 1, then 
(7.11) [Xx”] = |X|”. 
Example 7.8. Let n be a positive integer. Then 
[Os 1" | =o, 
This formula is related to power sets, which we will consider in {7.4] below. 


7.3. The jinclusion-exclusion principle 


In this section we discuss the formula for the cardinality of the union of finitely many 
finite sets, called the inclusion-exclusion principle. This generalizes the addition 
principle in {7.i]to the case where the sets may intersect. 


Recall that, given two sets X and Y, their set difference is defined to be 
(7.12) X-Y={r#eEex|c¢ VY}. 
Theorem 7.9. For any sets X and Y, we have 
(7.13) XUY =(X -Y)U(Y-X)U(XNY), 
where the right side is a disjoint union of sets 
Exercise 7.3. Without using Venn diagrams, prove Theorem 
Example 7.10. Let k and n be positive integers. Let X = {0,1,2,...,n—1,n}, 


and let Y = {—k, -k +1,...,—1,0}. Then 
X—Y = {1,2,...,n—1,n}, 
Y —X = {-k,-k+1,...,—-1}, 
XY = {0}, 


and the disjoint union of these three sets is equal to X UY. 


7.3.1. Inclusion-exclusion principle for two finite sets. Using the theorem 
above, the following result about the cardinality of the union of two finite sets 
is easy to prove from the additional principle for sets. It says that if you count 
the things in a set X and then add the count of things in a set Y, you will have 
counted everything. But, you will have counted the things in both sets twice. So 
just subtract the number of things in the intersection. See Figure [7.3.2] for the case 
of 3 sets. Can you draw the analogous picture for 2 sets X and Y? 
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Theorem 7.11 (Inclusion-exclusion principle for two sets). If X and Y are finite 
sets, then 


(7.14) |IXUY|=|X|+ |Y|-|XNY]. 
Proof. You may wish to draw Venn diagrams to picture the proof; cf. Figure 5.1.3) 
By Theorem and Corollary 

|IXUY|=|X —Y|+/¥ —X|+|XNY]. 


We wish to express the right side more in terms of |X| and |Y|. To this end, since 
X =(X —Y)U(XNY) is a disjoint union, we have 


X|=|X —Y|+|XNY|. 

Similarly, 

Y|=|Y—X|+|XNY|. 

Hence |X — Y| = |X|—|X MY] and |Y — X| = |Y|— |X MY]. We conclude that 
|XUY|=|X -—Y|+|/¥-X|+|XnY| 

=|X/+/¥|=|ANY|. 


7.3.2. Inclusion-exclusion principle for three finite sets. Using the same es- 
sential idea as for two sets, we can obtain a formula for the cardinality of the union 
of three sets. 


Theorem 7.12 (Inclusion-exclusion principle for three sets). If X, Y, and Z are 
finite sets, then 


(7.15) |X UYUZ|=|X|+|Y|4|Z|-|XNY|-|XnZ|-|YnZ|+|XnYnZ|. 


We can use to visualize the inclusion-exclusion principle. A 


Venn diagram for counting the union of three finite sets is given in Figure [7.3.1 


xX Y 


Z 
Figure 7.3.1. Venn diagram for the union of sets X,Y, Z. 


A schematic diagram for using the inclusion-exclusion principle for counting the 
union of three sets is given in Figure |X|+ |¥|+ |Z] overcounts. A correction 
is subtract: |XNY|+|XNZ|+|Y¥NZ|. But then this undercounts by: |XNYN Z|. 
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Z 


Figure 7.3.2. Counting overlaps in the Venn diagram for the union X UY U Z. 


xX Y 


Z 


Figure 7.3.3. After subtracting the cardinalities of the double intersections 
XNY, XMZ, and YNZ, we undercount by the triple intersection X NY NZ. 


Exercise 7.4. Of the 202 dinosaurs at| Jurassic Park 
(1) 125 eat apples, 131 eat bananas, and 136 eat cherries, 


(2) 81 eat apples and bananas, 84 eat apples and cherries, and 88 eat bananas and 
cherries, 
(3) 41 eat all three fruits. 


Using the inclusion-exclusion principle, find how many dinosaurs at Jurassic Park 
eat none of the three fruits. 


Exercise 7.5. For the previous exercise, write out the formula for how many di- 
nosaurs at Jurassic Park eat none of the three fruits assuming the following: 

(1) There are d dinosaurs at Jurassic Park. 

(2) a dinosaurs eat apples, b dinosaurs eat bananas, and c dinosaurs eat cherries. 


(3) ab dinosaurs eat apples and bananas, ac dinosaurs eat apples and cherries, 
and bc dinosaurs eat bananas and cherries. 


(4) abc dinosaurs eat all three fruits. 


Here, ab, ac, bc, and abc are variable names and not the products of a,b,c! 
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The readers may wish to prove for themselves the inclusion-exclusion principle 
for three sets in a similar spirit as the case for two sets. We will prove the general 
case of n sets below. 


7.3.3. Inclusion-exclusion principle for a finite collection of finite sets. Here, 
we will find it convenient to introduce a notation to describe intersections of sub- 
collections of the collection of finite sets. Let X,,...,Xx be a collection of finite 
sets, where & is a positive integer. Given a set of 7 distinct indices 
(7.16) I = {ty,i2,...,t;} CN, = {1,2,...,k}, 
where 1 <j <k, define 
(7.17) Xp =[) Xi =X XQ: 0X 

ier 


igi 


That is, the set N; is the set of indices of our collection of finite sets. A subset I 
of Nx with |Z| = 7 corresponds to a choice of j sets of this collection. The set X; 
denotes the intersection of these 7 sets. To see this more concretely, we consider 
the case k = 3: 


Example 7.13. For each non-empty subset J C Nz, we write out the set X;. Note 
that the number of such sets X; is equal to 2? — 1 = 7. 


1) P= {1}, Xp =X. 

2) I= {2}, Xp = Xp. 

3) I = {3}, X; = X3. 

4) I= {1,2}, Xr = X10 Xp. 

5) I= {1,3}, Xp = Xi N.X3. 

6) I = {2,3}, Xp = X20 Xz. 

T) F=10 93), Spe Ry Mo 


Theorem 7.14 (Inclusion-exclusion principle for n sets). Let X1, X2,...,Xn be a 
collection of n finite sets. Then 
(7.18) |JXyU X2U+-UXnl= So (-dl xy]. 

OAICNn 
Example 7.15. For n = 3, we write out the sum on the right-hand side of (2.18) 
as 


So (ayer = [2] + [Xa] + |X| 
OAICNs — |X, X_| — |X X3| — |X2n Xz] 
+ |X1N Xo X3i. 


Observe that the first line has the terms with |I| = 1, the second line has the terms 
with |Z| = 2, and the third line has the terms with |J| = 3. From this, we see that 
when n = 3, Theorem reduces to Theorem [7.12 


Exercise 7.6. Count the overlaps (numbering 1 through 4) for the Venn diagram 
of 4 sets in Figure Your picture should be analogous to the Venn diagram in 
Figure If you prefer a more symmetric picture, see Figure [7.3.5 
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Figure 7.3.4. A Venn diagram for 4 sets! Can you count overlaps as in Figure[7.3.2? 


Figure 7.3.5. A more symmetric Venn diagram for 4 sets X1, 2, X3, and 
X4, each represented by an isosceles triangle. See “Venn Symmetry and Prime 
Numbers: A Seductive Proof Revisited” by Stan Wagon and Peter Webb, in 
American Mathematical Monthly, August 2008, pp. 645-648. 


7.3.4. A conceptual proof of the inclusion-exclusion principle. Why the 
inclusion-exclusion principle is true. Here is the intuition for why Theorem 
[7.14]is true. Consider the sum 


(7.19) [Xa] + [Xa] tes + LA. 


This sum multiply counts certain elements of the union |X,UX2U---UX,,|. On the 
other hand, all of the elements outside of the union of the double intersections are 
singly counted correctly. So we add to the sum in (7.19) the non-positive correction 
term 


(7.20) — SO [Xin Xj). 


l<i<j<n 


We claim that with this correction term we are correctly counting the elements 
outside of the union of the triple intersections. Indeed, for elements in the union 
of the double intersections, but outside the union of the triple intersections, we 
are correctly removing exactly 1 from the double counting to get the correct single 
counting of these elements. On the other hand, for the elements in the union of 
the triple intersections, we have overcorrected and have hence undercounted these 
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elements. We thus next add the non-negative correction term 
(7.21) + So [XN Xi Xi. 
1<i<j<k<n 
Continuing this way, we see that the inclusion-exclusion formula should be an al- 


ternating sum exactly as it is written on the right-hand side of (718). 


Did you glimpse at what lies ahead? For the impatient reader who just wants 
the verbose author to get to the point, you may proceed directly from here to 47.3.6} 
below for a quick and dirty proof before reading the following detailed proof! 


7.3.5. The full proof of the inclusion-exclusion principle. 


Proof of Theorem Inductive step and base case. Inductive hypothesis: 
Assume that k > 2 is an integer with the following property (the inclusion-exclusion 


principle for k sets): For any finite sets Y;,...,¥,, we have 
(7.22) Yiu UK) = So (-1)¥I-4 Vy]. 
OAICNy, 


Note that by Theorem [7.11] k = 2 has this property and hence the base case is 
true. 


Proof of the inductive conclusion: Let X1,...,Xp41 be finite sets. Of course, 
we have 
X1,U++-UXp41 = (X1U-++-U Xp) U Xp41. 
By the inclusion-exclusion principle for two sets and by the inductive hypoth- 
esis, we have that 
(7.23) 
|[X,U-+-UXp41| = |(X,U-+-UX_%) U Xe44| 
= |X, U++-UX;,| + |Xp41| — |(X,U---U Xp) AX e441 
= S* Cyl xy] + | Xegil —|(X1 UU Xe) 0 Xgl. 
OAICN, 


To prove the inductive conclusion, that is, the inclusion-exclusion principle for k+1 
sets, we need to rewrite the right-hand side of the display above. To this end: 


Claim 1. 


(7.24) (X1U-+U Xe) A Xpg1 = (X10 XK41) UU (XE XK 41), 
and hence 

(7.25) \(X1U++-U Xe) O Xegi| =|¥1U--- Us|, 

where 

(7.26) Y; = XiN Xeys. 


Proof of Claim 1. Let P(k) be the statement that for every k + 1 sets 
Ce Cee. oe 


(7.27) (X1U-+-U Xp) A Xo = (X19 Xo) ++ U (XR Xo). 
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It is easy to see that by calling X,41 to be Xo, Claim 1 is equivalent to the assertion 
that P(k) is true for all k € Z*. 


Base case for Claim 1: k = 1. In this case the statement is that (X1) Xo = 
(X19 Xo), which is obvious. 


Inductive step for Claim 1. Suppose that k > 1 is such that for every k+1 
sets Yo, Y1,.-.-,Yz, 


(Y,U---UY,) AVY = (YN Yo) U-+-U (¥, Yo). 


We want to prove the next case. So let Xo, X1,...,X4%, Xp41 be (kK +1) +1=k+2 
sets. Then, by the distributive law and by the inductive hypothesis, we have 


(X,U+--U Xp, U Xp41) A X09 = X1U-: -U Xp) UXK41) Xo 


(( 

= ((X1U Pee (Xn419 Xo) 

= ((X1N Xo) U-++U (XM Xo)) U (Xe41M Xo) 
SOG PU Xe ere 


This proves the desired inductive conclusion. Therefore this completes both the 
inductive step and the proof of Claim 1 by induction. 


Now, by applying Claim 1 to (723), we have 


(7.28) |X, U+--U Xe] = S> (1) | X7| + |Xeqi| — |[¥iU---U Yi. 
OAICNyE 


As for the intersections of X’s, define for the intersections of Y’s the notation 


(7.29) Yr =()¥i = YAY. ANY), 
jed 


for subsets J = {j1,..., Je} C Ne. 

Claim 2. If JC Ng, then 
(7.30) Yy = XO Xeg = X supers: 
Proof of Claim 2. Let J = {j1,..., je} C Ny. Then 


Yr = ¥,0-°- V5 
= (Xj, 1 Xeg1) 1+ 1 (Xj, Xe41) 
= (Xj,N-+-N Xj) Xeq1 
= XIN X41 


= X Ju{k+1}; 


where the last equality is true since J and {k + 1} are disjoint. 
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Now, by (2.28), the inductive hypothesis, and Claim 1, we obtain 


(7.31) 
|X, U+--U Xp] = > (a0 ee a he SS (=p, 
= SS) 1)" X14 |Xeult SS (HD Xue! 
OAICNE OAICNe 
= Pa (-1)""" x7] + SS (-1)""|Xjugesayls 
OAICNy JICNe 
where for the last equality we used that Xgu,%41} = Xk+1- Regarding one of the 
exponents of (—1), notice that |J| = |J U{k+1}|—1. We are getting close to 


completing the proof of the inductive step. 
Claim 3. If0 AI C Nps, then exactly one of the following holds: 
Case (i): 0A I CNg. 
Case (ii): [= JU{k+ 1}, where J C Ng. 


k+1 k+1[] {k+1} k+1[] {k+ 1} 
Nx Ni Nz || J 
£ 
I#0 OU{k +1} ={k+1} JU{k+1}, J#0 
Case (i) Case (ii) 


Figure 7.3.6. Picturing schematically the two types of non-empty subsets of Nz41. 


Proof of Claim 3. Let § 4 I C Nx41. Then Case (i) corresponds tok +1 ¢ J, 
and Case (ii) corresponds to k +1 € I. This proves Claim 3. 


Now, by applying Claim 3 to (731), we have 
SS EDP xr] + Xevl+ SS AD Xsupaagl= SS (14-4 pg. 
OATCNyE OAICN:E OATCNe4+1 


The inductive conclusion follows. 


By induction, we conclude that Theorem [7.14]is true. 


7.3.6. A quick and dirty proof of the inclusion-exclusion principle. Less is 
more! To understand proofs better, it is convenient (but sometimes frustrating for 
the reader!) to skip steps. Having or not having read the detailed proof above, 
can you fill in the details of the following inductive step in the proof of the general 
inclusion-exclusion principle? 

By the inclusion-exclusion principle for two sets and by Claim 1, we have 


(7.32) |X, U-+-U Xp41] — |X1 U---U Xz | = |Xz41| — |Y¥; U---UY,|, 
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where Y; = X; 1 Xx41. So the inductive step is true provided 
(7.33) YS DM  Xa| = |Xeral -— [K++ U al 
ICNe+1,I1ZNe 
This formula is equivalent to 
(7.34) S- (=1)!"|X juresay| =—|Y,U---UY,|. 
OAITCN:E 
Since X yurx41} = Yz, this in turn is equivalent to 
(7.35) dD CUP ys] = MUU Ye, 
OAICNy, 
which is true by the inductive hypothesis! 
Exercise 7.7. Now that you know the idea of the proof of the inclusion-exclusion 
principle: 
(1) Prove the n = 3 case of the inclusion-exclusion principle directly. 


(2) Prove the n = 4 case of the inclusion-exclusion principle directly. 


7.4. Binomial coefficients and the Binomial Theorem 
Did you hear about the mathematician who couldn’t afford to eat out? 
They could binomials. 


Suppose you are among four mathematicians (including yourself) eating at a 
restaurant. How many possibilities are there for choosing two of you to play a game 
of tennis? Binomial coefficients answer this question. If we further ask to list all 
the possible groups of two players, then we are led to the power set of the four 
mathematicians. 


7.4.1. The power set of a set. Let X be aset. The|power set|of X, denoted 
by P(X), is the set of all subsets of X. An element A of P(X), denoted by 
A€é P(X), is the same as a subset A of X, denoted by AC X. To wit, 


AEP(X) & ACcCX. 
We are most interested in the case where X is the set 
Ng = 4 2 0s th {ke Zt : Leeent. 

This is because N,, is the prototypical (model case) finite set. Of course, |N,| =n. 
Example 7.16 (Power sets of N; for k < 4). 

(1) The power set of Nj is 

P (Ni) = {0, {1}}. 
(2) The power set of No is 
P (Na) = {0, {1} {2}, {1, 2}}. 
(3) The power set of N3 is (for a visualization, see Figure [7.4.1) 


P (Ns) = {9, {1}, {2}, {3}, {2,3}, {1,3}, {1, 2}, {1, 2, 3}}. 
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Figure 7.4.1. Visualizing the power set P(N3). 


(4) The power set of N4 is the set whose elements are 
0, 
{1}, {2}, {3}, {4}, 
{1,2}, {1,3}, {1.4}, {2,3}, {2,4}, {3,4}, 
{1,2,3}, {1,2, 4}, {1,3,4}, {2,3, 4}, 
{1, 2,3, 4}. 


Exercise 7.8 (Properties of power sets). 

(1) Explain why 0 € P(X) for every set X, where P(X) is the power set of X. 

(2) Suppose that AC B. Prove that P(A) C P(B). 

(3) Let U be the universal set. Suppose that A and B are sets. Explain why 

P(ANB) Cc P(A) NP(B). 
Exercise 7.9. Define f :P(Z) x P(Z) > P(Z) by 
f(A, B) =AUB. 
Let E be the set of even integers and let O be the set of odd integers. Find all 
(A, B) € P(Z) x P(Z) satisfying 
f(A, B)=Z, ACE, and BcoO. 


7.4.2. Binomial coefficients. Given an integer k > 0, let P(X) be the set of 
k-element subsets of X; that is, 


Pr(X) ={AE P(X): |A] =k}. 


We have 
Pr(Nn) = {Ae P(N,) : |Al =k}. 
In the British television series the protagonist is Number Six. 
Suppose that ten residents of the Village, including the prisoner, are numbered 


1 through 10/4] Then P4(Nio) can be interpreted as the possible groups of four 
residents chosen to play the card game Bridge. 


1However, Number One has never been seen, and Number Two, the Village administrator, con- 
stantly changes. 
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Example 7.17. P2(N,) is the set of 2-element subsets of the set {1,2,3,4}; its 
elements are 

{1,2}, {1,3}, {1,4}, {2,3}, {2,4}, {3,4}. 
We see that each element of P2(N4) corresponds to a choice of 2 distinct elements 
out of the 4 elements of Ny. 


For a visualization of this, see Figure [7.4.2] 


A e e e 
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Figure 7.4.2. Visualizing P2(Na). 


Since P;(N,,) is empty if k < 0 or k > n, we shall assume that 0 < k < n. 
Since every subset of N,, has between 0 and n elements, inclusive, we have 


P(Nn) —= | | Px(Nn), 
k=0 


where the “disjoint union” symbol | | indicates that the sets P,(N,,) that we are 
taking the union of are disjoint; i-e., 


(7.36) Pr(Nn)AP(Nn) =O for k Fé. 
Effectively, we have the power set of N, into the 0-element subsets, 
the 1-element subsets, the 2-element subsets, ..., up to the n-element subsets. 


Example 7.18. We have 
P(Na) = Po(Na) U Pi (Ng) U Po(Na) U P3(N4) U Pa (Nz), 
where the right side is a disjoint union. 


This all may seem somewhat complicated, but we greatly simplify things by 
only concerning ourselves with counting, i.e., the cardinality of finite sets. Define 


the |binomial coefficient 
n 
= N, 
(;,) = [Peta 


to be the number of (how many) k-element subsets of N,,. Now, each element of 
Px(N,,) corresponds to a choice of k distinct elements out of the n elements of N,,. 


For this reason, the binomial coefficient @ is also called n choose k. 


Fact: The power set of a finite set is a finite set (we will prove this in the 
next section). Hence, each P;(N,,) is a finite set and hence each @ is a (finite) 


non-negative integer. 
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Example 7.19. By Example[7.17| we have 


(5) = iPattayl =6. 


By the addition principle for sets, we have 
[P(Nn)| = D> IPe(Nn)l = 52 (7). 
k 
k=0 k=0 


One may think of a particular subset of N, as corresponding to a particular set of 
n choices of whether to put each of the elements 1,2,..., in the subset. Since for 
each element of N,, we have two choices, in or out, we obtain 


IP(Nn)| = 2" 


(see also (7.109) below). Therefore, we have the binomial coefficient summation 
identity 


(7.37) 3 (1) =o, 


k=0 
ag 
E le 7.20. =1+4 4+1=16=2'. 
xample i) +4+6+4+ 6 


Binomial coefficients have the following symmetry property. 


Lemma 7.21. [fn is a non-negative integer and if k is an integer satisfying 0 < 
k<n, then 


vs ()=(2.) 


Proof. Define the “complement function” 


(7.39) e:PeINn) > Pa—x(Na) 
by 
(7.40) c(A) = A®. 


Its inverse function is given by (exercise!) 


(7.41) c': Pn—k(Nn) + Pe(Nn) 
where 
(7.42) c(aSes 


Hence, c is a bijection. 
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{3,4,5} 
{2,4,5} 
{2,3, 5} 
{2,3, 4} 
(1, 4,5} 
{1,3, 5} 
{1,3,4} 
{1,2, 5} 
(1; 2a) 
11 e% 


@\/@e;e@e;el;e;e;e;ele,e 
@\/@e;e@e;@e;e@e;e;e;e;e,e 
@e/@e;e@e;@el;e@e;el;ei;eleie 


5 5 
Figure 7.4.3. Visualizing the (;) = CG case of Lemmaf[7.21] The 3-element 
sets consist of red dots and the 2-element complementary sets consist of blue 
dots. 


15 15 
Example 7.22. i) = (?) = 105. 


Thinking about it, this all makes sense. The binomial coefficient (1) can be 


viewed as the number of ways, given n mice, to choose k of them to be members 


of ‘The Mickey Mouse Club, Now, the number of ways of forming such k-member 


mice clubs (consisting of the hoity-toity cool mice) is the same as the number 
of ways of choosing which n — k mice will not be in the club (the down-and-out 
nerdy mice). The point is that choosing which mice are “cool” by exclusion is the 
same as choosing which mice are “nerdy”. For example, suppose we have 3 mice 


named Moe, Curly, and Larry, Let us look at the number of ways of choosing 2 


cool mice to be in the club. This is the same as choosing 3 — 2 = 1 nerdy mouse 
3 
not to be in the club. There are (;) = 3 ways to do this: (1) choose Moe to be 


nerdy, (2) choose Curly to be nerdy, or (3) choose Larry to be nerdy. These three 
choices are in one-to-one correspondence with the three choices: (1) choose Curly 
and Larry to be cool, (2) choose Moe and Larry to be cool, or (3) Choose Moe and 
Curly to be cool. 


Exercise 7.10. Let m be an odd positive integer. Let 

m m 
Ga) Xe {A € P(Nm) : |Al < =}, Y:= {A € P(N): |Al > =}. 
Prove that 
(7.44) |X| = |Y]. 


7.4.3. Relations between binomial coefficients. We have the following|/recur-| 
lsive formula for binomial coefficients (for the application to Pascal’s triangle, see 
{7.4.4] below). 


Theorem 7.23. For positive integers n andr such thatl<r<n, 


ea ee) 
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5lelelelele ele] e ele ° 
4} elelele e lee ele ° ° 
Y 3/elele. el ele e |e el]. . 
Q|e]e elele elel|e elele 
1|e eo|elele efelelelelfe 
5 . rt . s.| 
4 ° ° ° ° ° 
X 3/0 ° ° ° efe 
2 ° ° e fel] e 
1 ° e}ele]e 
Figure 7.4.4. For Exercise[7.10] the sets X and Y when m = 5. Each column 
of the lower and upper grid represents an element of X and Y, respectively. For 


example, the lower rightmost column represents the subset {4,5} and upper 
rightmost column represents the subset {1,2,3}. If a subset A represents a 
lower column, then its complement A® = Ns — A represents the column directly 
above it. 


Proof. For a visualization of this proof, see Figure [7.4.5] 
For an r-element subset A of N,,, 
(1) call A Type-In ifn € A, 
(2) call A Type-Out ifn ¢ A. 


rf 

e e e | e e e 

e e e : e e e 

e e e : e e e 

e e e 1 @ e e 

ey es ES eS ee ee . S 
ID 19 19 19 19 19'% SYS SH + 
N Oo SF wo HS HIN N O 
ee 
BO I 
ees | SS 
Stn = P2(Na) ' Sout = P3(Na) 
—_—_— 


Figure 7.4.5. Visualization of the proof of Theorem[7.23]for n = 5 and r = 3. 


From this picture we conclude that (;) = () = (;): 
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Clearly, every A € P,(N,,) is either Type-In or Type-Out. We partition the set 
P,(N,) by defining the disjoint subsets: 
Sin = {A € Pr(Nn) |n € A}, 
Sout = {A € Py (Nn) |n ¢ A}. 
Then P,.(N,) = Sin U Sout is a disjoint union of finite sets, so by the addition 
principle we have 


(7.46) |P-(Nn)| = [Stal + |Souel - 
Claim. We have: 
(i) |Stn| = |Pr—1(Nn-1)]- 
(ii) |Sout| = |Pr(Nn-1)|- 


The theorem follows from the claim since the claim implies 
n n—-1 n—-1 
(") = [Ps(0n)] = Prana] -+ Po (Smad = ("7 1) + ("2"), 


Proof of the claim. 


Practical proof of (i). Elements of St, are those r-element subsets of N,, 
which contain the element n. By removing the element n from these subsets, this 
corresponds to arbitrary (r — 1)-element subsets of N,,1. Thus 


(7.47) |Stn| = |Pr—1(Nn-1)| : 
Formal proof of (i). We define 

Ff; Stn — Pr—i(Nn-1) 
by F(A) = A—{n}. Clearly F(A) C Nn-i. Since n € A, F(A) U {n} = A, 
so by the addition principle |F'(A)| +1 =r. Thus |F'(A)| = r—1 and hence 
F(A) € P,-1(Nn_1). This shows that the function F' is well-defined. Conversely, 
define 

G: Pr—1(Nn-1) = Stn 
by G(B) = BU {n}. Since B is an (r —1)-element subset of N,-1, BU {n} is 
an r-element subset of N,,; so G is well-defined. We now show that F' and G are 
inverses of each other. 


Let B € P,_1(Nn_1). Then F (G(B)) = F(BU {n}) = (BU {n}) —{n} = B 
with the last equality true since n ¢ B, which is true since B C N,-1. 

Let A € Sj,. Then n € A. We have G(F(A)) = G(A-— {n}) = (A—{n})U 
{n} = A with the last equality true since n € A. 


Practical proof of (ii). Elements of Sj, are those r-element subsets of N,, 
which do not contain the element n. This corresponds to arbitrary r-element subsets 
of Nn_-1- Thus 


(7.48) |Sout| = |Pr(Nn-1)] - 


We leave the formal proof of (ii) as an exercise. The theorem follows. 


Exercise 7.11. Give a formal proof of claim (ii) in the proof of Theorem [7.23] 
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Suppose we have a basket with m objects and another basket with n objects. 
How can we describe the possible choices of a total of k objects from both baskets? 
The following gives us an answer. 


Exercise 7.12. Let m and n be positive integers, and let0 <k < m+n be an 
integer. Given disjoint finite sets X and Y with |X| =m and |Y|=n, define 


‘ 

(7.49) f :Px(X UY) > J P(X) x Pr_i(¥) 
1=0 

by 

(7.50) f(C) =(CNX,CNY). 


(For example, we may take X = N,, and Y = {m+1,m+2,...,m+n}.) Also 
define 


k 

(7.51) g: U Pi(X) x Pr_i(Y) + Pa(X UY) 
i=0 

by 

(7.52) g(A, B) = AUB. 


(1) Explain why the functions f and g are well-defined. In particular, why are the 
function values contained in the codomain as advertised? 


(2) Prove that the functions f and g are inverses of each other. Did your proof use 
that X and Y are finite sets? Did your proof use that X and Y are disjoint? 


(3) Show that one can conclude that 


(7.53) ws ") - ("') (.",) . 


x 
xX 
SS ——— ee YS 
Pi(X) x Pal(Y) P2(X) x P3(Y) P3(X) x P2(Y) 


Figure 7.4.6. For Exercise[7.12] suppose that |X| = 3 and |Y| = 4. We have 
Ps(X UY) & an Pi(X) x P5_i(Y), where © means “is bijective to”. Note 
that Figure [7.4.10] below corresponds to the 10th column from the left in this 
figure. Can you explain how this figure is related to (773)? 


330 7. Counting Finite Sets 


7.4.4. Pascal’s triangle. Pascal’s triangle is 


dl 
1 1 
1 2 il 
1 3 3 1 
il 4 6 4 i 
1 5 10 10 5 ik 
1 6 15 20 15 6 i 
1 re 21 35 35 21 7 1 

etc. 


As you may see, an integer in the triangle is the sum of the two integers above it 
20 

35 : 

Remarkably, Pascal’s triangle is made up of binomial coefficients as we see by 


and closest to it. For example 35 = 15 + 20 occurs in 


4 4 
Note that (1) (3) represents 


va 0-00 


7.4.5. Pascal's triangle and Fibonacci numbers. In Figure we saw that 
the first several diagonal sums give Fibonacci numbers {f,}°2,. Let us try to 
formulate this as a mathematical statement and prove this statement in general. 
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From Figure we see that 


0 
—! —s 1 
(anon 
1 
(5) fo ’ 
2 a) 
()()=-4 
0 0 
(7.55) 3 5 
(0) +G)-a=s 
4 2 a 
(0) + (1) +(:)-#=» 
5 4 3 
(0) +) + G)=#=8 
etc. 
Exercise 7.13. Define ne 0 ifm <0, 5 <0, ors > m. Prove that the 
8 


recursive formula of Theorem [7.23] hold for all integers r and n. 
The following generalizes (7.55). 
Exercise 7.14. Let fy, denote the m-th Fibonacci number. Prove that for n € Z2, 


(7.56) fu= >) (‘). 


jtk=n J 


k 
where in the sum we may assume that k > 7 > 0 since otherwise ( ‘) = 0. 
Jj 


7.4.6. Modular Pascal triangles. The first eleven rows of Pascal’s triangle are 
given by Figure [7.4.7] 

We can consider the remainders modulo 3 of the binomial coefficients to form 
the “mod 3 Pascal triangle”. See Figure [7.4.8 

We will leave it to the reader to investigate what patterns there are for modular 
Pascal triangles! 


7.4.7. Binomial coefficient factorial formula. We have the following|factorial| 
|formulal for the binomial coefficients. 


Theorem 7.24. For non-negative integers n and r such that r <n, 


ee nl 


is 


(7.57) 


r! ri(n—r)t 


Proof (less formal, more intuitive). The binomial (") counts the number of 
r 


unordered r-element subsets of a set of cardinality n, such as N,. Such a subset 
can be obtained by first choosing r elements of N,,. There are n choices for the first 
element. Given any choice of the first element, there are always n — 1 choices for 
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ows of Pascal’s triangle. 


Figure 7.4.7. The first eleven r 


Figure 7.4.8. Pascal’s triangle modulo 3. Can you fill in the last two blank 


rows and beyond?! 


the second element, which has to be distinct from the first element (independent of 
what the choice was for the first element). Then, given any choices for the first two 


distinct elements, there are always n—2 choices for the third element. We continue 
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in this way until we have chosen the r-th element, for which there are n — r+ 1 
choices. Thus there are 


(7.58) n(n — 1)(n—2)-+-(n-r+1) 


possibilities for ordered r-element subsets of N,,. Since we are counting unordered 
r-element subsets of N,, and since there are r! permutations of any set of cardinality 
r, we have that there are 
n(n — 1)(n—2)---(n—r4+1) 
r! 
_ n= 1) 2) (n= 1) (nn) =P a Te 2a 
r!-(n—r)(n—r—1)---2-1 


n! 


n 
unordered r-element subsets of N,,, and this number by definition is equal to ( ) ‘ 
r 


Proof. The proof is by induction. Let n > 0 and let P(n) be the statement that 


foralO<r<n 
n\ n! 
r) ri(n—ry 


The base case is n = 0. One assumes P(k) for some k > 0. To prove P(k+ 1), 
one uses the identity from Theorem [7.23] (take n = k + 1) 


es. = (A) +() 


and one uses the inductive hypothesis to calculate the right side in terms of facto- 
rials. Here one verifies the formula for 0 <r < k+ 1. Separately, one checks that 
the formula in the theorem is true for r = 0 and for r =k +1. 


Observe that Lemma [7.21] follows immediately from Theorem [7.24] 


Example 7.25 (Special values of binomial coefficients). For n > 0, 


ca ()-()- 


For n> 1, 


(7.60) (") = (," ) =n. 


For n > 2, 


(7.61) (5) = (,"4) = eae 


Exercise 7.15 (A binomial identity). Prove that 


(7.62) (7) = a(t a, 
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Exercise 7.16 (Property of a prime choose k). Let p be a prime and let 1<k< 
p—1. Prove that the binomial coefficient (?) is divisible by p. 


7 
For example, the binomial coefficients a 1<k <6, are 7, 21,35, 35, 21,7, 
where each of these integers is divisible by 7. 


Exercise 7.17 (Another binomial identity). Prove that 


(7.63) > #(4) =n}, 


7.4.8. Proof of the Binomial Theorem. Monomials of one variable are non- 
negative integer powers of a single variable, denoted such as a”, where Z2. More 
interestingly, we can consider (powers of) binomials of the form (a+ b)”. The 
following result explains the terminology “binomial coefficients” . 


Theorem 7.26 (Binomial Theorem). For all real numbers a and b and non- 


negative integers n, we have the expansion 
n n 
n_ n—-ipi 
(7.64) (a +b) => (Je b 
i=0 
Sar pert (T)antB tb  ba 
a 


For example, 


(7.65a) (a +b)? = a? + 2ab +B’, 

(7.65b) (a + b)? = a? + 3a7b + 3ab? + 0%, 

(7.65c) (a + b)* = a4 + 4a°b + 6a7b? + 4ab? + b4, 

(7.65d) (a+ b)> =a° +5a*b + 10a°b? + 10a7b* + 5ab* + b°. 


Proof (less formal, more conceptual). We have 


(7.66) (a +b)" = (a+b)(a+b)(a+b)--- (a +6). 


The right-hand side is the sum of products 
(7.67) C1 €2€3°** Cn, 


where each c;, 1 <i <n, is either a or b. By the commutativity of multiplication, 
we may write each such term as a”~*b’, for some 0 < i < n, since in view of 
(n — 1) +% = n such binomials are the products of n numbers, each of which is 
either an a or a b. Now, given 0 < i < n, the number of terms of the form a”—*pe 
is equal to the number of ways to choose c;, c2¢3-++Cp, to consist of 7 b’s and n — 1% 


sag n 
a’s. Thus, the number of terms of the form a”~‘b’ is equal to { . ]. The Binomial 
a 


Theorem follows. 
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b a®b Fee sh ee +(,* Jaro + (Harm ae eed veh (Pash + peti 


a aktl + (jaro oh, Se (, k )akeeiye +(j Jato + ae Ad ee ie 


b k k; k k 
Ne chi de k4+2—-ipi-2 k+1-ipi-1 k-ipi k-1-ipi+] k 
a” =P eee b +(," Ja b +(i)a oF (0, )< b Biles ay; 


Figure 7.4.9. Visualizing the inductive step of the proof of the Binomial 
Theorem [7.26] Using Theorem we can infer the formula to prove the 


6 k eae k+1 eo 
inductive step (a + b) ty (| )ak-i0i = was ( : Jak+t—ipi from ‘this 
figure. 


We now start thinking about a formal proof of the Binomial Theorem. Regard- 
ing the terms in Figure[7.4.9] by Theorem [7.23] we have 


(7.68) ab (‘) a*b = C * ‘) a*b, (Task +ab* = (" - ‘ath 
i 1 1 1 
(7.69) ‘ k )abeeiye + (, k he = (; 0 ee, 
t—2 s— 1 i—1 


(7.70) (, k 1) ghti—igi 4 (‘) ghti-ipi = (* + ‘) aktli-ape 
= 7 7 


etc. 


Proof (by induction). If you like to think about proofs visually, you may refer 
to Figure [7.4.9] as you read this proof. 


We leave it to the reader to check the base case n = 1. By induction assume 
that k > 0 is such that 
k 


(a+b)*= > (‘) ait, 


i=0 
We compute 


(a+b)**" = (a+b)* (a +b) 


a*~'b' (a+b) (inductive hypothesis) 


k 
_ G gers S- (‘) abit! (distributive law). 
i=0 
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We want the exponents in a’~*b'+! above to match up with the exponents in 


alk+1)—*b? for the k +1 case of the Binomial Theorem. So we define 
=i, 


so that summing i from 0 to k is the same as summing 7’ from 1 to k + 1. We get 
that the last line in the display above is 


k 7h RAL pp - 
= S- ( ‘ ger —tpf 4 a & jaa b’ (writing in terms of 7’ instead of 7) 
oS i oe 
k k+1 
— S- gy as S- ; al®+))—*h! (renaming i’ to be #) 
aan 8 =o 
0 k k 
= (k+L)—ipi (k+1)-ipi Gy 
=) ("a b » (He y+ (,* Je b 
i=0 i=l i=l 
k+1 k 
(k+1)—ipi 
+ pS (, 7 ie b 
i=k+1 


k 
k k k a 
= (k+1)—0}0 S (k+1)—ipi 


he (, ki janes 


k+1)-1 
K+1\ (441)-070 a go ea k+1 7 ; 
= b (k+1)—ipi (k+1)—(k+1) pk+1 
( 0 Ja oe , }% bY + r41)% 
k+1 
k+1 : — 
— as ( ; Jalota 


i=0 


where the penultimate equality is by Theorem [7.23] and because 


(== (7) (een—a)=G-1=G), 


Analogous to Pascal’s triangle in {7.4.4 the expansions in the Binomial The- 
orem can be visualized as Figure [7.0.1] at the beginning of this chapter (compare 
with (7.65)). 

It is interesting to consider the Binomial Theorem in modular arithmetic. Es- 
pecially useful is the case where the modulus is a prime. 


Exercise 7.18 (Binomial Theorem modulo a prime). Let p be a prime, and let 
x,y € Z. Prove that 


(7.71) (x+y)? =x? +y? mod p. 
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Hint: Use Exercise Some “prime” rows of Pascal’s triangle are 


1331 
15101051 
1 7 21 35 35 21 7 1 
1 11 55 165 330 462 462 330 165 55 11 1 


If we count the 1 at the top of Pascal’s triangle as the 0-th row, then the display 
above pictures rows 3, 5, 7, and 11. 


Exercise 7.19 (Proof Fermat’s Little Theorem using the Binomial Theorem). We 
give another proof of Fermat’s Little Theorem [6.63] Let p be a prime and let a be 
a positive integer. 

(1) Prove that (a +1)? =a? +1 mod p. 

(2) Prove by induction on a that a? = a mod p for all positive integers a. 

(3) Assuming in addition that a is not a multiple of p, prove that a?~+ = 1 mod p. 
Exercise 7.20 (More binomial identities). Let k,@ € Z=. We have the obvious 
identity 

(l+a)*? = (1+2)¥(1+2)*. 
By applying the Binomial Theorem to each of the binomials (1+ x)*t+*, (1 +2), 


and (1+2)* and by matching the resulting coefficients of x™, whereO<m<k+4, 
for both sides of the identity, prove the following formula. ForO<m<k-+8, 


min{k,m} 
k+eé k £ 
i=max{m—£,0} 


For example, if k = 3, €=4, andm=5, this says that 


(7.73) 6 = ()) (i . ) 


Indeed, this is true because 
21=3+4+12+6. 


Show, as a special case, that we have the formula 


e 7 (') ; 
nj} 2 i} 
1=0 
7.4.9. Leibniz’s rule for higher-order derivatives of products. We now give a 


calculus application. Using essentially the same proof as for the Binomial Theorem, 
we can prove a product formula for the higher derivatives of one-variable functions. 


Theorem 7.27. Let f and g be one-variable functions that have derivatives to all 
orders (such functions are called smooth). For alln € Zt, 


n 


Gg" => @ fg, 


k=0 
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Proof. A proof of this formula is at the general Leibniz rule, Wikipedia page. The 


proof we give here is essentially the same. 
We prove the theorem by induction. 
The base case, n = 1, is true because 


1 


(fg) = (fo)! =fo't+f'g= (5) fugs-O4 ()) foyga-9 => (me 


by the usual Leibniz rule. 
Inductive step. Suppose for some m € Zt that 
rs “ (m ae? 
(ayer => (2) Fo k), 
k=0 


We then compute that 
(fg) = aay 


k 
k=0 
=> 1 Gf fern) gh k) dg fh) gim k+1)) gape (fF) = f(t dite 
k=0 
= (im (k+1) gi k) ~ (k)g (m+1—k) 
= (E) ernie + (Fs 
k=0 k=0 


At this point, we need to decide how to proceed to obtain the formula in the desired 
form. We observe that the term with fg("+!-), which is the second term on 
the right side of the last displayed line, is of the desired form but has the wrong 


coefficient (‘"). We rewrite the first term on the right side of the last displayed 
line to help us. Namely, by letting k = @— 1, we have 


m m+1 m+1 
MY e(k+1) (m—k) _ m (2) (m+1-£) _ m (k) (m+1—k) 
eae S- (7, ) rato, 


k=0 é=1 k=1 
where the last equality is by relabelling @ as k. Thus 
m+1 i 
(m+1) _ (k) .(m+1—k) Gor k) 
(fg) ar ht \e 


The formula we desire is a sum from k = 0 tok = m+ 1. We have a mismatch 
of the summation limits in the two sums above. So we separate the special cases 


7.4. Binomial coefficients and the Binomial Theorem 339 


k =m-+1 and k =0, which each only appear in one sum, to get 


m “ m m+1—k m m m m 
(fa)! ey (he #1-k) (™\x #1) g(0) 4 (7) £0! +1) 


k=1 
o> (TY p(k) glm+1-k) 
+>) i ) rg 
k=1 
= (") FO gD 4 5~ 
k=1 
m m 
ne (”) pom g(0) 


m+1 a m m . 
= (0) ,(m+1) (k) .(m+1—k) 
("parame (GT) +) 


mt+1 m 
+( ae +1) g(0) 


m = “(im eae 
(." :) fF) glm+1—-k) +>" ("£9 +1-k) 
k= 


1 


m- 


m+1 i “(m+ Ae ee ae m+1 i 
=( ‘ ) 19 4 9>( A ) 10 +1 + ( \x +1) g() 


m+1 
k=1 


m+1 


m+1 m+1— 
=->{ ; 19 Hk), 
k=0 


where for the penultimate equality we used the recursive formula of Theorem [7.23] 
This proves the inductive step and hence, by induction, completes the proof. 


Remark 7.28. The interested readers (or should we say the bored readers? | may 
read on their own about multinomial coefficients and the|multinomial theorem 


7.4.10. More relations between binomial coefficients. 
Solved Problem 7.29. Let X and Y be disjoint finite sets with |X| = 3 and 
|Y| = 4. Can you formulate an approach to compute 5 in terms of choosing 


pairs of subsets of X and Y whose unions have exactly five elements? 


Solution. Imagine a subset A of X of cardinality i and a subset B of Y of 
cardinality 7. By the addition principle, the cardinality of AU B equals 5 if and 
only if¢+ 7 = 5. In other words, if i := |A|, then |B| = 5 — 7. Now, |A] < |X| < 3, 
and |B| < |Y| < 4. Thus, we have the restrictions i < 3 and 5 —i < 4, which are 
equivalent to 1 <i< 3. 


Since X and Y are disjoint, by Exercise [7.12] there is a one-to-one correspon- 
dence between cardinality-5 subsets C of X UY and pairs (A, B), where A C X and 
B CY, with |A| =7, |B] = 5-7, and 1 <i <3. That is, there exists a bijection 
between P5(X UY) and the disjoint union 


(7.74) (Pi(X) x Pa(Y)) U (Po(X) x P3(¥)) U (P3(X) x P2(Y)). 


2Interested in something more; bored with something less. © 
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From this and the addition principle, we obtain the formula 


(7.75) (7) = |P5(X UY)| 
= | (Pa(X) x Pa(¥)) U (P2(X) x Ps(¥)) U (P(X) x Pal¥))| 
= |Pi(X) x Pa(¥)| + [P2(X) x Pa(¥)] + |Ps(X) x Pal¥)| 
= |Pi(X)|-|Pa(¥)| + [P2(X)|- [Ps] + [Ps(X)| - [P| 


-()Q+G)6)+@)G) 


Observe that, indeed, both sides of the equation above equal 21. 

We can imagine the formula in the following way. We can think of X UY as 
having 3 red rooms and 4 blue rooms, where red and blue correspond to X and Y, 
respectively. A cardinality-5 subset of people in X UY consists of i people in X 
and 5 —i people in Y, where 1 <i < 3. See Figure [7.4.10] 


lea | Le La 


Figure 7.4.10. An example of a cardinality-5 subset with 2 people in X and 
3 people in Y is pictured. 


Solved Problem 7.30. Prove the binomial coefficient identity: 


em GT )-ER- 0+ Cr +02) ++@). 


Solution. Consider the power set Pxii(Nn+1), whose cardinality is equal 


an 
Sk, Sk4+1,Sk4+2-+-,Sn, where 


(7.77) Si = {A © Prai(Nn4i) : max A =i+ 1}. 


Indeed, since the maximum of a subset of N,,+1 is unique and since a subset having 
cardinality k + 1 implies that its maximum is at least k + 1, we have that 


n+1 
to ( _ by definition. We partition the set Pxii(Nn+1) into disjoint subsets 


(7.78) Proi(Nnai) = LU Si = Sk USk41 USky2 UU USp 
i=k 
is a disjoint union. Thus, by the addition principle for sets, 


n 


(7.79) & : i) =~ Sil. 


i=k 
For each k <i <n, we define the function 
(7.80) F;: S; > Px (Ni) 
by 
(7.81) P(A) =A-—{i+]}. 
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The function F; is a bijection with inverse equal to the function G; : P,(N;) > S; 
defined by G;(B) = BU {i+ 1}. Thus, 


a 
(7.82) [Sil = [Pu = (5). 
The result follows. 
e aa pei cans: 
S3 S4 Ss 


Figure 7.4.11. The power set P4(Ng) is partitioned into the disjoint union 
3 4 5 
S3 US4USs5. We have |S3| (5) , [Sal (3) _ and |S5| (3) 0 


Solved Problem 7.31. Prove the identity 


eal 


(7.83) f= >> (oS ‘) 


i=0 


al ha a hee Te 


for alln € Zt, where fy, denotes the n-th Fibonacci number and |-| is the floor 
function. 


Solution. Let F(n) be the statement that formula (7.83) is true. Let P(k) 
be the statement that F'(2k — 1) and F(2k) are both true. We prove the result by 
induction on k. 


Base case: k = 1. We need to establish P(1) and P(2). Firstly, P(1) is the 


0 
—i 
statement that f; = ) ( : i; which is true since both sides of the equation equal 
i 
= 
‘ 0 . 
1-i ae : 
1. Secondly, P(2) is the statement that fo = y ( ), which is also true since 
i 
i=0 


again both sides of the equation equal 1. 


Strong inductive step: Suppose that k € Z* is such that P(k) is true; that is, 
F'(2k — 1) and F(2k) are both true. Now, 


(7.84) AS] = = | =k-1. 


2 2 
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Thus, our aforestated inductive hypothesis says that 


k-1 
2k —i-2 
(7.85a) for-1 = ( . ) 
=r 
1 


k-— . 
(7.85b) i= @ a ‘), 
=0 


We compute that 


fori = for-1 + for 


1=0 1=0 
k k-1 
~ (2k 1 ~ (2k 
= »( : - ) ( : ) (replaced i by 1 — 1 in the first sum) 
i=1 i=0 


_ 2+y (eas ‘ @ — 2) . 


By applying Theorem [7.23] we may rewrite this as 


as Fee > € ‘ z > we :) 


i=l i=0 


This proves F'(2k + 1). 
It remains to prove F(2k + 2). We compute that 


forte = fox + fort 


ee (eS ) 
4 >> aaa 
=) ge wana 


This proves F'(2k + 2). Thus, the proof of P(k +1) is complete. By induction, we 
are done. 
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Remark 7.32. Let 

ae 
7.87 — : 
(7.87) mim on 


We can illustrate the inductive step by considering the calculations corresponding 
to the special case k = 3. Namely, we show that g7 = g5 + ge and gg = ge + g7. 
That is, at this stage, the sequence {g,,} satisfies the same recursion relations as 
the Fibonacci sequence {f,}. By (287), we have 


o-()-0)- 
n= ()e()) 


roe) (C=) 
-()+()+(9+0) 


= 97; 


where the last equality is by (7.87). Similarly, we can show that gg = go + g7. We 
leave this as an exercise for the reader. Since g1 = gg = 1 = fi = fo and since 
the sequences {g,,} and {f;,} satisfy the same recursion relations, we conclude that 
they are equal as sequences. 


Solved Problem 7.33. Prove that for every n € Z2, 
(7.88) -  \ok = 3", 
k 


Solution. Define the set 
(7.89) X :={(A,B): ACBCN,}. 
Firstly, we compute the cardinality of X as follows. We partition the set X by 
the subsets defined by 
(7.90) X,:= {(A,B) eX: |B) =k} ={(A,B) eX: BEP;(N,)} 
for 0 <k <n. Indeed, we have that 


n 


(7.91) X = (J Xp = XoUX1UXQU++-UXp 
k=0 

is a disjoint union. For each B € P;,(N,,), let 

(7.92) X® := {(A, B): AC B}. 


Then the function F? : X? — P(B) defined by F?(A) := (A, B) is a bijection. 
Therefore, 


(7.93) |\xX?| = |P(B)| =27! =2". 
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On the other hand, we have that 
(7.94) a= | a 
BEP;. (Nn) 


is a disjoint union (exercise). Hence, by the addition principle for sets, we have 


(7.95) el So ikei= Se 2 = [Pun 2* = (71) 2 


BEPx (Nn) BEPx (Nn) 


Therefore, 
n n n 
(7.96) l= Ss eal= (;) OF 
k=0 k=0 


Secondly, we can compute the cardinality of X as follows. Each pair (A, B) © X 
is in one-to-one correspondence with a sequence of n choices, where the 7-th choice 
is whether: 

(1) i€ A (and hence i € B), 
(2) 7€ B-A, 
(3) i € B (and hence i ¢ A). 
Observe that these three choices are mutually exclusive since 
N, = AU(B-— A)U(N, — B) 


is a disjoint union. Since for each 1 < 7 < n there are exactly 3 choices, there are a 
total of 3” choices. Since there is a one-to-one correspondence between sequences 
of n such choices and elements (A, B) € X, we conclude that |X| = 3”. 


The result follows since the two ways of counting the cardinality of X must 
yield the same number. 


Figure 7.4.12. The trichotomy of choices R.B.G. 


7.5. Counting functions 


Instead of|counting stars, we count functions. Given two non-empty sets K and Y, 
the set of all functions from K to Y is denoted by Fun(K,Y). That is, 


(7.97) Fun(K,Y) := {f|f:K — Y isa function}. 
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Figure 7.4.13. For n = 3 the partition of X into Xo U Xi U X2U X3. We 
3 3 3 
O01, |X 21 x 2? = 12, and |X3| = 
have |Xo| = (5)2° = 1, |Xil = (5)2" = 6, [Xa] = (5) , and |X| 


(;) 23 = 8, 


7.5.1. Counting all functions. The main result of this subsection is the fol- 
lowing funky theorem (funky theorems come from Funkytown). 


Theorem 7.34. Let K and Y be non-empty finite sets. Then Fun(K,Y) is a finite 
set and 


(7.98) | Fun(K,Y)| = |y|/*!. 


The reason why the theorem is true is that if the domain of a function has 
cardinality & and the codomain of the function has cardinality n, then a function is 
given by a sequence of k choices, where for each choice there are n possible choices. 
Since the choices are independent, the total number of possible sequences of choices 
is equal to 


(7.99) nen na=nk, 
cae eee 
k times 


Here is the actual proof: 


Proof. Since K is a non-empty finite set, we may express it as 
(7.100) K = {x1,%2,..., 2x}, 


where k = |K|. We define a bijection between Fun(K, Y) and Y* as follows. Define 
the function B : Fun(K,Y) > Y* by 


(7.101) B(f) = (F(#1), F(@2), ++) f(we)). 
And define 

(7.102) C:Y* + Fm(K,Y) 

by 


(7.103) C(yi,y2,---;Yr) = the function g defined by g(x;) = y; for 1 <i<k. 


We leave it to you to check that B and C are inverses of each other, and hence they 
are bijections. 


The theorem is proven since by Corollary [7.7] we have that 
(7.104) |Fun(K,Y)| = |Y*| =|Y|F = [yA 


Exercise 7.21. Prove that the functions B and C in the proof of Theorem [7.34 
are inverses of each other. 
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7.5.2. The power set. Firstly, we observe that by taking Y = {0,1} in Theo- 
rem [7.34| we obtain the following. 


Corollary 7.35. If X is a finite set, then Fun(X, {0,1}) is a finite set and 
(7.105) | Fun(X, {0,1})| = 2!*1, 
Let X be a set, and let A be a subset of X. The characteristic function of 
A is defined by 
1 ifaeA, 
oenne) xa(z) = { 0 ifa@d A. 


Define the functions ® : Fun(X, {0, 1} P(X) by 


= 
(7.107) O(f) =f"), 
the pre-image of 1 under f. Here W: an — Fun(X, {0,1}) by 
(7.108) W(A) := 
the characteristic function of A. 


Lemma 7.36. The functions ® and WV are inverses of each other and hence are 
bijections. Since Fun(X, {0,1}) is finite, this implies that P(X) is finite and 


(7.109) |[P(X)| = | Fun(X, {0, 1})| = 2/*1. 
Exercise 7.22. Prove Lemma [7.36] 


7.5.3. Counting injective functions. Let m,n be positive integers. Define the 
set of functions 


(7.110) F:={f|f:Nm— Np is a function}. 
Now let m < n be positive integers. Define the subset of injective functions 
(7.111) I :=Inj(Nm,Nn):={f: Nm N,|f is an injection}. 


By thinking of each injective function from N,, to N, as corresponding to a 
way of lining up m people out of a group of n people, it is easy to see the following. 


Theorem 7.37. The number of injective functions from Nm», to Nn, t.e., the car- 
dinality of I, is equal to 


(7.112) Z| = n(n —1)(n—2)---(n—-m4 1). 


Proof. We forgo giving a rigorous proof in lieu of a more intuitive argument. We 
sketch the rigorous proof in Exercise We can count the number of injective 
functions as follows. Firstly, there are n choices for f(1). Secondly, given any 
choice of f(1), there are n — 1 choices for f(2) such that f(2) 4 f(1), which is 
necessary for f to be an injection. Thirdly, given any such choices of f(1), f(2), 
there are n — 2 choices for f(3) such that f(3) ¢ {f(1), f(2)}, which is necessary 
for f to be an injection|'} We continue in this fashion until we have made choices 
for f(1), f(2), f(3),..., f(m), where all of the values of f are distinct so that f is 
an injection. In particular, m-thly, there are n —m +1 choices for f(m). The total 


3This is independent of which choice you made for f(1), and similarly for the subsequent choices. 
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number of different sets of m choices, with each set of choices corresponding to a 
different injection, is equal to the product of m integers: 


(7.113) n(n — 1)(n— 2)-+-(n—m-+1). 


Table 7.5.1. Counting the number of injections from {1, 2,3} to {a,b,c, d}. 
There are 4 choices for f(1), there are 3 choices for f(2), and there are 2 choices 
for f(3). In total, there are 24 = 4-3-2 injections. The shaded set of choices 
represents the function f defined by f(1) =c, f(2) =d, and f(3) =a. 


f a b fs d 
b c d a c d a b d a b c 
3}cld}b|d}blele|dla;ldja}lc|b|djaj/d}| albjb\|ecelalslelaj)b 


Exercise 7.23. In this exercise, we give a rigorous proof of Theorem [7.37] by 
induction. 


(1) (Base case) Prove Theorem in the case where the cardinality of the do- 
main ism = 1. 


(2) (Inductive step) Suppose that m € Z* has the property that the number of 
injections from Nm to Ny is equal to n(n — 1)(n — 2)---(n-—m-+1) for all 
positive integers n. Prove that this formula is true for m replaced by m+ 1. 

Hint: Justify that we have the disjoint union 


(7.114) TnjNasi, Nn) = |) 
j=l 
where 
(7.115) Ty = {f €Inj(Nmii,Nn): f(m +1) = 5}. 


Show that the cardinality of each set I; is equal to the cardinality of the set 
Inj(Nin, Ny-1)- 


See Figure for a visualization of the injections from N3 to N4. 


4 . . 
3] - ele . . . olelele |e 

N 

- dQ} - |e]. gf sae cee | Pa) lf-ven is |e . a: || . 
i) |e & | a | we . . 
SS a 
qi, In Iz I, 

Figure 7.5.1. We visualize injections f from N3 to Ny by coloring f(3) green, 


f(1) red, and f(2) blue. 


If the cardinality of the domain and codomain are equal, then we immediately 
obtain the following consequence of Theorem [7.37] 
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Corollary 7.38. The number of injective functions from Ny, to Ny, is equal to 
(7.116) Inj(N,,N,) = n! 


Since by Theorem the set of injections equals the set of surjections, which in 
turn equals the set of bijections, this same count holds for the number of surjections 
and the number of bijections. 


Recall that a bijection of a finite set is also called a permutation. 


7.5.4. Counting surjective functions. Now let m > n be positive integers. 
Define the subset of surjective functions 


(7.117) S:={f:Nm—7N,|/f is a surjection}. 

The set of non-surjections N is 

(7.118) N:=F-S. 

We can think of S as the set of ways one can give n monkeys a total of m numbered 


bananas, where each monkey gets at least one banana. 


We can write N as a union of sets of functions as follows. Define 
(7.119) Aj :={f:Nn > N, |i ¢ Im(f)} 


for 1<i<n. Then 
(7.120) N=|J)A. 
t=1 


Indeed, if f € A; for some i, then f is not a surjection, so f € N. On the other 
hand, if f € N, then f is not a surjection, so that there exists an 7 such that 
i ¢ Im(f); that is, f € Aj. 

Since all of the sets we considered above are subsets of the finite set Fun(X, Y), 
these subsets are finite sets. Therefore we can use the inclusion-exclusion principle 
to compute the cardinality of N. 


Given a non-empty subset I of N,,, define 


(7.121) A; :={f:Nn > N,|IN Im(f) = 9}. 

Observe that for each 7, 

(7.122) Hep ode 

Also observe that if I = {t;,i2,...,%%} C Nn, then 

(7.123) en rena eee ae a eee 
ie] j=l 

Evermore liking to make observations, we further observe that 

(7.124) A; = Fun(Nn, Nn — J). 

Thus, 


(7.125) |Az| = (n — |Z|)”. 
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Now we may apply the inclusion-exclusion principle (Theorem[7.14) to obtain 
that the cardinality of the set of non-surjections is 


(7.126) IN=|UAd= So (play). 
i=1 OAICNy 
By applying (7.125) to this, we have 
(7.127) IN}= SO am =|)”. 
OAICNn 


Furthermore, given 0 < k < n, there are 4 subsets of order k in N,. That is, 
there are & many I’s with |I| =k. We conclude that 


(7.128) |N| = ee @ (n—k)™ = ype @ (n—k)™. 


k=1 
Thus, the cardinality of the set of surjections is 


(7.129) |S| =|F|—|N| =n D-ye(?) (n—k)™. 
k=1 


Recall that if m =n, then a surjection is a bijection. The number of bijections 
to N,, to itself is n!. Thus, as a special case, we obtain 


(7.130) nl=n™ + S71)" @ (n—k)”. 
k=1 


Example 7.39. We list the surjections from N3 to N2 as follows. Given a surjection 
f, we list its images f(1), f(2), f(3) in Table [7.5.2] which defines f. 


Table 7.5.2. There are six surjections from N3 to No. 


Ff) | FQ) | £R) 
i |) | 2 
1 | 2 ]1 
i | Sei] 3 
2) 1 ]1 
2 | 2 | 2 
2) 2 | 


This agrees with (2.129), which says that the number of surjections is equal to 


(7.131) 2° — a (2-1)? =8-2=6. 


Let {7,7} = {a,b} as (unordered) sets. Then the sets defined analogously to 
(119) given by 
(7.132) A, ={f:Ns3 —- {a,b} : f(k) = 7 for all k Ee Nz} 
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Table 7.5.3. The surjections from N3 to {a,b}. The shaded cells represent 
the surjection f defined by f(1) = 6, f(2) =a, and f(3) =b. 


1 a b 
2\/a b a b 
3/b]/alblal b 


are each singleton sets consisting solely of the constant function with image {7}, 
for i € {a,b}. We count by (7126) that |N| = |A,| + |A,| = 2. So, again we see 
that the set of surjections from N3 to {a, b} has cardinality 

(7.133) |S| = | Fun(Ns, {a, b})| — |N| = 8-2 =6. 

Exercise 7.24. Similarly to Table [7.5.2] list the surjections from Ny to No and 
verify that the number of surjections agrees with formula (7.129). 


7.6. Counting problems 
In this section we count the number of ways to sum a fixed number of integers to 
a given integer. 
Example 7.40. Let n be a positive integer. How many integer solutions are there 
to the equation 
(7.134) rty=n 
with « >0 and y= 0? 

The answer to this question is easy in the sense that we can “list” the solutions: 
(7.135) (0,7), (1,n— 1), (2,n — 2), ..., (m,0). 
So we see that there are n + 1 solutions. 


If you are not convinced that there are exactly n +1 solutions, we can make 
the following more formal argument. Let S be the set of solutions, whose elements 
are listed above. Note that $ Cc Zt x Z*. Define the function 


(7.136) r:S 32+ 
by 
(7.137) (x,y) =a. 


(This function is the[restrictionlof a/projection| function.) Then 7 maps SJ bijectively 
to the subset {0,1,2,...,n}. The latter set has cardinality n+1, and hence so does 
S. 


In terms of counting, we have n + 1 choices for x (zero through n, inclusive) 
and then y is determined by y = n — x. So the number of solutions is equal to 


(7.138) ("T") i ("* ") 


We can also think of the question in the following tricky way. Imagine that 
we have n ping pong balls lined up in a row. To describe the solution x = 3 and 
y = n-—3 (assume n > 3), we put a ping pong paddle as a divider right after the 
third ping pong ball. So, in general, a solution (#,y) tox+y =n with x > 0 and 
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y => 0 corresponds to choosing one spot out of n+ 1 to put the ping pong paddle, 
or equivalently, n spots out of n +1 to put the ping pong balls. 


Example 7.41. How many triples of non-negative integers (x, y, z) that satisfy the 
equation 

(7.139) xtytz=7 

are there? 


We use the same idea as in the previous example. In this case, we can model 
this question as follows. Suppose that we have 7 Suppose that each 
Pokémon is one of three types: (1) [Xenal fans, (2) fans, and (3) fans. 
How many totals (x, y, z) are there in having x Xena fans, y Yoda fans, and z Zeus 
fans? Each total corresponds to putting two dividers between the 7 Pokémon. For 
the dividers and Pokémon combined, we have 9. The number of ways of inserting 


two dividers is &) 


For example, choosing the two dividers at the positions 5 and 7 corresponds to 
choosing « = 4 Xena fans (1, 2,3,4) and y = 1 Yoda fans (6) and z = 2 Zeus fans 
(8,9). See Figure [7.6.1 


Figure 7.6.1. 4 Xena fans, | Yoda fans, and 2 Zeus fans, separated by dividers. 


Lemma 7.42. Let n and k be positive integers. The number of non-negative integer 


k-tuples (11, %2,...,2%) satisfying the equation 
k 

(7.140) Soap = a tagt--+a,=n 
i=1 

is equal to 


(7.141) yet") = ae: 


Proof. The proof we give involves looking at the result in a clever way: 

Imagine we have a row of n ones, which of course sum up to n. Add to this 
row of ones k — 1 dividers, which we can denote by vertical lines. Then, in total, 
we have a row of n+’ — 1 ones and dividers altogether. 

Claim. Sequences of n ones and /—1 dividers are in one-to-one correspondence 
with non-negative integer solutions (x1, 22,...,2,%) to (7140). 

To see the claim, we describe a bijection and its inverse between the set of 
sequences and the set of solutions. 

Given a solution (11, 22,...,%) as above, we put the first divider in the 7; +1 
spot. For example, if x; = 0, then we put the first divider in the first spot. 
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We then put the second divider in the 7; + 22 + 2 spot. In this way we will 
have x1 ones, the first divider, x2 ones, the second divider, in that order. 


Continuing this way, we see that we should put the (k — 1)-st divider in the 
ty +%o+---+2~_-1 +k—1 spot. We have defined a sequence of n ones and k — 1 
dividers. 


Why k& — 1 dividers? Think of & — 1 walls separating k people in a row. For 
example, 1 wall separates 2 people. 


We now define a function which is the inverse of the function described above. 


Suppose that we are given a sequence of n ones and k — 1 dividers. Let the 
spots of the k — 1 dividers be 


(7.142) Yr < Y2< 01+ < Yr-1- 
We define 


ri=yi-i, 


LQ i= yo - 21 — 2, 


and in general 


(7.143) X= Yi Ly x2 gece Ui-1 i for 1 < 1 < k- 1, 
and 
(7.144) Lh =N-— £1 —LQ—+++— Le. 


Then the function that this describes is the inverse of the previously described 
function. We leave this fact as an exercise. This completes the proof of the claim. 


Now the claim implies that the set of sequences of ones and dividers and the 
set of non-negative integer solutions have the same cardinality. For the former set, 
each sequence corresponds to a choice of an n-element subset of N,+,-1 to put 

_ (n+tk-1 ; 
the ones. The number of such subsets is ( ) Equivalently (or should we 
n 
say complementarily?), each sequence corresponds to a choice of a (k — 1)-element 
n+k— ‘) 


subset of N,+,%—1 to put the dividers. The number of such subsets is ( k—-1 


Of course, by Lemma[7.21]the two binomials we arrived at are equal. 


5 

Example 7.43. For k = 3 and n = 3 the answer is (3) = 10. The list of 3-tuples 
summing to 3, in “increasing order”, is 

(0, 0,3), (0, 1, 2), (0, 2, 1), (0, 3,0), (1,0, 2), 

(1,1, 1), (1,2,0), (2,0,1), (2,1,0), (3,0, 0) 

If we choose to think in terms of rows of ones and dividers, as in the proof of 

Lemma [7.42] then the list (in the same order) looks like this: 

att, 411, faa]a, faa, 1], 

1/Ria, Dia. a) la Tae. Sn] |: 
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7.7*. Using the idea of a bijection 


We end this chapter with a fun application of using bijections of finite sets to find 
a strategy to win a game. 


7.7.1. Who can sum to 15? 


7.7.1.1. The rules of the game. Consider the following game you can play with 
your friend. The two of you take turns choosing from the integers 1 through 9, 
where once an integer is chosen, it is no longer available. The goal is to have three 
integers among all of the integers you have chosen to sum to the number 15 The 
game stops once one of you has three integers summing to 15 or when there are no 
longer any of the nine integers left. 


123456789 


Figure 7.7.1. The integers 1 through 9. 


7.7.1.2. A sample game. For example, your friend starts by choosing the num- 
ber 2. You then choose 5. Your friend chooses 8. You choose 6. The current 
situation looks like this: 


Your friend’s integers: 2, 8. 
Your integers: 5, 6. 
Integers remaining: 1, 3, 4, 7, 9. 


It is your friend’s turn. What should your friend do? 


Your friend calculates 15 — 2— 8 = 5 and wishes they could choose the number 
5 to win. But, alas, the 5 is safely in your possession. 


Next, your friend looks at the situation from your point of view and calculates 
15 —5 — 6 = 4, and they realize that you will win if you are able to choose the 4 
next. 


So your friend eagerly snatches up the 4 for their next choice. So now your 
friend has 2, 4, 8. They must like powers of 2! 


Now it is your turn and you must decide what integer to choose among the 
remaining 1, 3, 7, 9. You know you cannot win on this choice as the 4 is gone. 
What do you do? 


You decide that you are going to block your friend from summing to 15 on 
their next choice. Firstly, since your friend has 2 and 4, if they get 9, you are 
lost. Secondly, since your friend also has 4 and 8, if they get 3, you are also lost. 
Unfortunately, both 3 and 9 are available and you cannot choose both of them at 
once. So no matter what integer you choose, you will lose if your friend makes the 
correct subsequent choice! 


7.7.1.3. What are the ways to win? This is easy to see and we list the ways 
below in lexicographical order. There are eight ways to choose single-digit integers 
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to sum to 15: 
1,5, 9 
1, 6,8 
2, 4,9 
2,5, 8 
2, 6, 7 
3, 4,8 
3, 5, 7 
4, 5,6 


7.7.1.4. How do we win the game, or can we win the game? One way of analyzing 
the game is to place the nine integers 1 through 9 on a/tic-tac-toe board as in Figure 
Observe that there are 3 rows, 3 columns, and 2 diagonals (totalling 8) and 
that if you sum the 3 integers in any row or in any column or in any diagonal, 
the result is 15. So winning at tic-tac-toe corresponds exactly to summing three 
integers to 15. 


Now, with correct play, tic-tac-toe always ends in a draw. We will leave it to 
you to do a complete analysis of tic-tac-toe and verify this fact. For example, there 
are three types of first moves: 


(1) The first player puts an “X” in the middle square. Then the second player 
has to put an “O” in a corner square, or they lose. Etc. 


(2) The first player puts an “X” in a corner square. Then the second player has 
to put an “O” in the middle square, or they lose. Etc. 


(3) The first player puts an “X” in a side square. Then the second player has to 
put an “O” in either the middle square, the opposite side square, an adjacent 
corner square, or they lose. Etc. 


Table 7.7.1. The integers from 1 to 9 on a tic-tac-toe board (3 x 3 grid). 


2|7)6 
9;5]1 
4)3/8 


Exercise 7.25. Prove that with correct play, tic-tac-toe always ends in a draw. 
With correct play, what is the outcome of the game of summing to 15? 


7.8. Hints and partial solutions for the exercises 
Hint for Exercise Suppose that X and Y are disjoint. Use the addition 
principle to obtain a contradiction. 


Hint for Exercise [7.2] Firstly, show that the function f is well-defined; that is, 
if (v, y) is in the domain, then f(x, y) is in the codomain. 
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It is rather easy to see that f is a bijection because it is essentially the identity 
function. 


Apply the addition principle for n sets. 
Hint for Exercise We can compute that 
XUY =(AnY)JUANY))u(¥ox)uYnx*)) 
=(XNY)JU(X-Y)JU(XNY)U(Y—-X) 
=(X —Y)U(Y—-X)U(XNY). 
Show that the pairwise intersections of the three sets on the right-hand side are all 
empty. 
Hint for Exercise [7.4] Let A, B, C denote the sets of dinosaurs that eat apples, 
bananas, and cherries, respectively. 
By (1), a:=|A| = 125, b:= |B] = 131, and c:= |C| = 136. 
By (2), ab:=|AN B| = 81, ac:=|ANC| = 84, and bc := |ANC| = 88. 
By (3), abc :=|ANBNC| =41. 
By the inclusion-exclusion principle, we have that the set of dinosaurs which 
eat at least one of the three fruits has cardinality 


|AUBUC|=|A]4+ |B] +|C|—|AN B)-|ANC|—|BNC|+|ANBNC|. 
Continue this calculation. 


Hint for Exercise The way we solved the previous exercise solves this 
exercise. 


Hint for Exercise The following diagram gives the counts: 


Hint for Exercise We prove the n = 3 case. Let Y; = X,M X3 and 
Yo = X2M X3. By the inclusion-exclusion principle for 2 sets, 
(7.145) |X, U XqU X3| = |Xy U Xe] + |X3| —|¥1 UY9| 


since (X; U X2)M X3 = Y; UYo. The result follows by applying the inclusion- 
exclusion principle for 2 sets to both |X, U X9| and |Y; U Y3|, while observing that 
YiN Yo = X10 X2N X3. 
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Hint for Exercise (1) The empty set @ is a subset of any set. 
(2) Suppose A C B. Let C € P(A). Show that C € P(B). 
(3) Use that ANBCAand ANBCB. 
Hint for Exercise The answer is that we must have (A, B) = (E,O). Use 


that AUB =Z and B C Oto prove that EC A. Similarly, prove that O C B. See 
Figure [7.8.1 


O 


E 


Z, 


Figure 7.8.1. This Venn diagram shows that Z is the disjoint union of F and 
O. Using that Z is also the disjoint union of A and B, we deduce that A = E 
and B = O; that is, the red shaded parts contain no elements. 


Hint for Exercise Define the “complement function” c: X — Y by 
(7.146) c(A) = A®. 


Use the addition principle to show that this function is well-defined. 


Show that the complement function c’ : Y > X defined by c’(B) = B® is the 
inverse of the function c. 


Hint for Exercise [7.11] We define 

Hf: Sout > Pr(Naci) 
by H(C) := C, and we define 

EK: Py(Nn-1) 4 Sout 


by K(D) := D. It is easy to see that both functions H and K are well-defined and 
that they are inverses of each other. 


Hint for Exercise[?.12] (1) Show that f(C) € P(X)xP(Y). Denote? := |CNX], 
so that |CN X| € P;(X). Show that |CN Y| = k — 1, so that C € Py_i(Y). This 
implies that the function f is well-defined. 

Let (A,B) € es Pi(X) <x Pp_i(Y). Then there exists 1 <i < k such that 
A€P;(X) and B € P,_;(Y). Show that g(A, B) C XUY and |g(A, B)| =k. This 
implies that the function g is well-defined. 

(2) Show that g(f(C)) = C and that f(g(A, B)) = (A, B), using that X and 
Y are disjoint. 
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(3) Since |X| = m, |Y| =n, and |X UY| = m+n, we may start a calculation 
by 


("yr") = Peru =| 


Continue. 


Hint for Exercise Ifn <0,r <0, or r > n, then each term in (7.45) is 
zero. We also check that (7.45) holds for r = 0. 


Hint for Exercise One starts the calculation by 


fn+3 = Frat a fn+2 


22) 


j+k=n jtk=n4+1 


= © (G)+G4)): 


i 


and one uses the previous exercise. 
Hint for Exercise [7.15} Use Theorem [7.24] 
Hint for Exercise Use Theorem [7.24] and properties about division by 


primes. 
Hint for Exercise Use the previous exercise. 
Hint for Exercise Use the Binomial Theorem. 


Hint for Exercise Use part (1) for the inductive step in part (2). 


k+é 
Hint for Exercise [7.20] By the Binomial Theorem, we have that ( ig ) is the 
m 


coefficient of x™ in the binomial expansion of (1+ 2)*t*, for0<m<k+é. On 
the other hand, by the Binomial Theorem, we have that 


(l+a)*¥.(1+2) ay @EDs ({) =" 


k £ 
(7.147) dd ai 5). 
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We claim that we may rewrite this as the sum 


(7.148) dd ali,m i), 


To see this, let Nj, := {0,1,2,...,k} for k € Z2. Then 


k L 
(7.149) SoS ali, ye aig). 


4=0 7=0 (4,9)E Nn x Ne 


Here, the sum over a finite set (such as N;, x Ne) is well-defined since addition is 
commutative. We can partition the (index) set Nx x Ne as 


k+e 

where 
(7.151) Im := {(t,9) © Ne x Nes i +j =m}. 
Thus, 

k+e 

~~ @&D= Yo Vows 
(i,j)ENnxNe m=0iEIm 
k+é m 
= (>. a(i,m — ») 
m=0 \i=0 


since (i,7) € I, means that 7 = m— i, where 0 <i <m. This proves the claim. 
Now, applying the claim yields 


7.8. Hints and partial solutions for the exercises 359 


where for the last equality we used that 


m—t 


(7.152) ({) =o for i> bs ( t ) =o for i<m—< 
i 


By equating the coefficients of x”, this proves formula (7.72). 
Hint for Exercise (1) We compute that 


B(C(y1, Y2,---,Yk)) = B(g), 
where the function g is defined by g(x;) = y; for 1 <i < k. Show that 
B(C(y1, ya, +++ Yk) = (Ys Yas +++ Yk): 
(2) Show that 


where h is the function defined by 

(7.153) h(xi) = f(z). 

Hint for Exercise We show that ®o W is the identity and leave it to you 
to show that Wo ® is the identity. We calculate that 

(7.154) ®(U(A)) = (v4) = XG (1) = A. 

Hint for Exercise [7.23] 


Inductive step: Suppose, as our inductive hypothesis, that m € Z* has 
the property that the number of injections from N,, to N, is equal to 
n(n — 1)(n — 2)---(n —m +1) for all positive integers n. 

Firstly, show that 

n 
(7.155) Inj(Nm+i,Nn) = LJ ij 
j=l 
is a disjoint union. 


Next, show that the cardinality of each set I; is equal to the cardinality of the 
set Inj(Nin, Nn_1). 
Let 1 <j <n and let f € J;. Since f is an injection, we have that 


(7.156) fin, ? Nm 7 Nn -— {9} = Nn-i 


is a well-defined injection. Moreover, it is not difficult to see that the function 
defined by f ++ f|n,, is a bijection from I; to Inj(Nm,Nn — {j}). Show that 


[Jj] = (n — 1)(n — 2)(n — 8)--- (n—m). 
Finally, show that 


| Inj(Nm4i,Nn)| = 5° 5 
j=l 


= n(n — 1)(n—2)---(n-—(m+1) +1). 
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Hint for Exercise See Table [7.8.1 


Table 7.8.1. There are fourteen surjections from N4 to No. 


ipeax’ 
gy, 
< 
Ss 
te 
& 


Ss 
NNN PENN NNR HID 
— 


ty 
Doe N NK eee ee lS 

i 
NOrNMOrRNFRFNNF RF NN FE 
PNmPNNFRFNFNF MH NY-E LD 


Hint for Exercise With correct play, the game of summing to 15 always 
ends in a draw. 


Chapter 8 


Congruence Class Arithmetic, 
Groups, and Fields 


|} | 
10-9 -—8 -7 -6 -5 -4-3-2-10 12 3 4 5 6 7 8 9 10 


Figure 8.0.1. The congruence class of red numbers is [0]3, the congruence 
class of blue numbers is [1]3, and the congruence class of green numbers is [2]3. 
In totality, we get the Notorious R.B.G.; ie., the union is RUBUG=Z. 


Goals of this chapter: The arithmetic of congruence classes. Some abstract 
algebra: groups, rings, and fields. The Fundamental Theorem of Algebra. The law 
of quadratic reciprocity. 


8.1. Congruence classes modulo m 


The idea of congruence classes is to put in the same basket integers that are con- 
gruent. It’s marvelous that we can do arithmetic with congruence classes. 


8.1.1. The definition of congruence class. Consider looking at integers from 
the point of view of dividing by 3. We know by the Division Theorem that there 
are 3 possible remainders: 0, 1, and 2. Two integers are congruent modulo 3 if 
and only if they have the same remainder when divided by 3. What if we put all 
of the integers that are congruent to each other in the same basket? How many 
baskets would we have? For example, all of the integers that have remainder 0 are 
congruent to each other, and these are the integers in one of the baskets, which we 
can color red. A second basket consists of the integers with remainder 1, which we 
can color blue. The third, and final, basket consists of the integers with remainder 
2, which we can color green. By the Division Theorem, any integer is in one of 
these three baskets, and for each basket the integers in it are all congruent to each 
other. Thus, we have 3 baskets, each of which is a subset of Z. We denote the red, 
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blue, green baskets by [0]3, [1]3, [2]3, respectively. Each of these sets is a congruence 
class, which we now define for every modulus. See Figure |8.0.1 


Definition 8.1. Let m be a positive integer. Given a € Z, its congruence class) 


modulo m, written as [a],,,, is the set of all integers congruent to a modulo m; that 
is, 
(8.1) [a], ={x € Z|x=a mod m}. 

That is, we group together all of the integers congruent to a fixed integer to 


form a set, called the congruence class of that integer. For example, the congruence 
class of —5 (modulo 3) is 


[—5|3 ={x# €Z:x2=-—5 mod 3} 
={-5+3k:k eZ} 
ee es eck eee ee ail ee 


Each congruence class is an infinite set. But we think of it as a single object! 


Solved Problem 8.2. Show that [—5]3 = [l]3. That is, the subset [—5]3 of Z is 
equal to, as a set, the subset [1]3 of Z. 


Solution. Using —5 = mod3 and the transitivity of congruence (modulo 3), 
we compute that 
[—5]3 = {x €Z: x4 =-—5 mod 3} 
={xe€Z:x=1mod3} 
= [I]Js. 
In Figure [8.0.1] each of the sets [—5]3 and [1]3 is the set of blue integers. 


We start off with some simple properties. 
Theorem 8.3. Let m be a positive integer. 
(1) For any integer a, we have a € {a)m.- 
(2) x € [alm if and only if m divides x — a. 
3) The congruence class of a is equal to 


( 
(8.2) [a], ={atkm:keZ}. 


Proof. (1) By the reflexive property of congruence, we have that a = a mod m. 
Therefore a € [a]m. 


(2) We have that 


xélaqm <— x=amodm 


& m divides x — a. 


(3) Let x € [a]. This is equivalent to x = a mod m. This, in turn, is equivalent 
to m divides x — a. This is also equivalent to the fact that there exists k € Z such 
that « —a = km; that is, c =a+km. This says that x € {a+km:keZ}. In 
conclusion, we have shown that x € [a] is equivalent to x € {a+ km:k € Z}. 
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Example 8.4 (m = 2). Since we are familiar with even and odd integers, we first 
consider the case where m = 2. 


(1) [0], is the set of even integers. Indeed, x = 0 mod 2 if and only if « = 2k 
for some integer k; that is, x is even. Of course, 0 € [O]o. 


(2) [1], is the set of odd integers. Indeed, x = 1 mod 2 if and only if «—1 = 2k, 
ie., e = 2k +1, for some integer k. The result follows from the characterization 


of odd integers. 
Observe the following: 
(3) [a]2 = [0]2 if and only if a is an even integer. 
(4) [a]o = [2 if and only if a is an odd integer. 


In words, the congruence class of an integer is equal to the congruence class of zero 
if and only if the integer is even. The congruence class of an integer is equal to the 
congruence class of one if and only if the integer is odd. 


Proof of (3). (=) Suppose that [aJz2 = [0]2. Since, by Theorem [8.3{1), 
a € [a]2, we thus have that a € [0]2, which implies that a is even. 

(<=) Suppose that a is an even integer. Then there exists an integer k such 
that a = 2k. Let x € [a]2. This is equivalent to x = a+ 2p = 2(k + p) for some 
integer p, which is equivalent to « being an even integer; that is, x € [O]2. Here we 
used part (1). We have proved [a]2 = [O]2. O 


Exercise 8.1. Prove Example [8.4{4); that is, [alz = [1]2 if and only if a is an odd 
integer. 


We can visualize the example above as follows. Let us choose to make even 
numbers red and to make odd numbers blue. Then, for every even integer a, we 
have that [a]z is the set of red integers. Similarly, for every odd integer a, we have 
that [a]2 is the set of blue integers. 


(8.3) fete S = 8) 104,90 94,.,.1. 


For any positive modulus m, we can perform a similar visualization with m colors. 
For example, Figure[8.0.1]shows for m = 3 the following color coding: 


(8.4) ee a ey ee ie ee eee 


8.1.2. Fundamental properties of congruence classes. Firstly, we understand 
congruence classes in terms of congruence. To get going, observe the following 
simple result. 


Lemma 8.5. The congruence class of an integer in a congruence class is equal to 
that congruence class. In fact, we have the characterization 


(8.5) ae [lin = [a] = [Bin ‘a 


Proof. (<) follows from a € [a]m. 


(=) Suppose a € [}],,. Then there exists an integer j such that a = b+ jm. 
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Thus 


(8.6) [a], ={atkm: ke Z}={b+(G+k)m:k eZ} ={b+lm:leZ} = [b,,. 


For example, 
aé[-2, @ [als =[-2y. 


The equivalent conditions in the display are each equivalent to a = 1 mod 3. 
Recall from that the set of remainders modulo m is 


(8.7) Rm = {0,1,2,3,...,m-— 1}, 
and we have the remainder function r : Z > R,,, defined by 
(8.8) r(a) =r, 


where r is the unique remainder given by the Division Theorem. 


Theorem 8.6. 
(1) Two integers having equal congruence classes is equivalent to them being con- 
gruent: 
(8.9) a=bmodm ss (a, =[),, ©  [al,, [0,4 9- 


Recall that two integers are congruent if and only if they have the same re- 
mainder. 


(2) Equivalently, two integers having unequal congruence classes is equivalent to 
them being incongruent: 


a#bmodm # (a, 4b, © [a], 9[,, = 9- 


m 


Two integers are incongruent if and only if they have different remainders. 
(3) We have 
(8.10) [@]m = [2(@)]m 
fora €Z, and the union 
(8.11) Z = (0| U [2],,U°::U[m— 1], 


is a disjoint union. That is, the collection of congruence classes partitions the 
set of integers. See below for the general definition of partition. 


Proof. (1) Let a and 6 be integers. Then 


a=bmodm Ss m divides a—b 
= aé(b],, 
[dy Mn #9. 
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Moreover, 
[a], [8], 79 = 3k,€eZ,at+km=b+lm 
=> dk,leZ,a=b+(l—k)m 
=> aé(b],, 


= [alm = [n> 

where the last <= follows from Lemma 

Part (2) is evidently equivalent to part (1) since any biconditional P = Q is 
equivalent to the biconditional not P © not Q. 

(3) Let a € Z. Then a = r(a) mod m by (6.13). Hence, part (1) implies that 
a € [alm = [r(a@)]m. Since r(a) € Zp, this proves 
(8.12) Z = [0]. UY Bn U Zl, U-:: Ulm — Ln: 
Now, if b,c € Zm satisfy b 4 c, then b # cmod m by Lemma [6.4] which implies 
that [b]m [clam = by part (2). This shows that the union in (8.12) is a disjoint 
union. 


m m 


Example 8.7 (m = 3). Revisiting the mod 3 discussion at the beginning of this 
chapter, we have by Theorem 8.6{1) that 


[alg = [b]l3 = a=bmod3 © a, b have the same remainder after dividing by 3. 
So, R := [0]3 is the set of all multiples of 3, B := [1]3 is the set of all multiples of 
3 plus 1, and G := [2]s is the set of all multiples of 3 plus 2. In other words, the 
red integers have remainder 0, the blue integers have remainder 1, and the green 


integers have remainder 2. See Figure for a visualization of these congruence 
classes. We have, for example, 


16 € [l]s, —7 € [2]s. 
We also have that Z = RU BUG is a disjoint union (as in (8.11)). 


[2]; <-++ t+—+ +—+—} +++} —+ + +—+ t— 
10-9 -8 -7 -6 -5 -4-3-2-10 12 3 45 6 7 8 9 10 

[1]3 <¢ + + +-—* +-—* + + +> 
10-9 -8 -7 -6 -5 -4-3-2-10 12 3 4 5 6 7 8 9 10 

[O]3 <¢ + +] I al & od +—1-> 
10-9 -8 -7 -6 -5 -4-3-2-10 12 3 4 5 6 7 8 9 10 


Figure 8.1.1. Visualizing Notorious R.B.G. as a partition of Z. Compare 
with Figure [8.0.1 
Exercise 8.2. Let S be the set of all integers n with the property that 
(8.13) Z = [n]z3 U [16]3 U [—16]s. 
Find this set S. 
Exercise 8.3. Find a positive integer m with the property that 
(8.14) [03 NE = [0|m, 


where E is the set of even integers. Prove your answer. 


366 8. Congruence Class Arithmetic, Groups, and Fields 


8.1.3. The set of congruence classes. By Theorem [8.6[3), the set of con- 
gruence classes is 


(8.15) Zn = {[O]p, > n> Zin o-- lm — 11,,}- 


This set has m elements. Each element of Z,,, is in and of itself a subset of Z. 


Example 8.8. Let m be a positive integer. The set 
[4], ={--.,4—2m,4—m,4,4+m,4+ 2m,...} 


itself comprises just one element of Z,,,. However, as a set, [4], C Z has infinitely 
many elements. 


Observe that there is a (natural) bijection between the set of congruence classes 
Zm and the set of remainders Ry». 


Lemma 8.9. The function f : Rm — Zm, defined by 
(8.16) f(r) =[r]m for 0<r<m, 
is a bijection. 

Exercise 8.4. Prove Lemma[.9} 

Exercise 8.5. Prove that for every integer b, 


(8.17) Zz = {[bs, [b+ 1]s, [b + 2]s}. 


Can you generalize this mod 3 statement to a mod m statement for every pos- 
itive integer m? 


8.1.4. Addition and multiplication of congruence classes. Let m be a positive 
integer. It makes sense to define algebraic operations on the set Z,, of congruence 
classes modulo m as follows. 


Theorem 8.10. The operations +, —, and: on Zp defined by 
+ 


(8.18a) [a]. + (ol, = [@+4l,,» 
(8.18b) [Ahn — [Din a [a ~~ Blin ’ 
(8.18c) [a] * [lm = [2 lm 


are well-defined. So we can add, subtract, and multiply congruence classes. 


Proof. So what is the issue? A congruence class is a set, and there are an infinite 
number of ways to write a congruence class. Namely, consider the congruence 
class [a] of an integer a. This congruence class is equal to the congruence class 
[a+ mk]m for each integer k. Since there are an infinite number of integers, there 
are therefore an infinite number of ways to write the congruence class [a]. So, 
when we define congruence class addition by (8.18al, we have to make sure that the 
definition makes sense, that is, is independent of the way we write the congruence 
class. 
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Suppose that [a]m = [a’Jm and [b]m = [b']m. We then have that a’ = a+mk 
and b’ = b+ mé for some integers k and &. 


(a) Thus, firstly, we have 
[a’ +m = [(a + mk) + (b+ mé)]m 
=[a+b+m(k+2)|m 
= [a+ dm, 


where we used Theorem [8.6{1) to obtain the last equality. This proves that the 
addition of congruence classes is well-defined. 


(b) Secondly, we leave the proof that the subtraction of congruence classes is 
well-defined as an exercise. 


(c) Thirdly, we have 
[a’ - B'|m = [(a + mk) - (b+ me)]m 
= [a-b+ m(al + bk + mke)|m 
= [a-d]m. 


This proves that the multiplication of congruence classes is well-defined. 


[M]m 


Figure 8.1.2. The modulo m congruence classes George and Martha. 


To recap the idea of adding congruence classes, suppose that George and 
Martha are congruence classed} i.e., George, Martha € Z,,. How do we add them 
to get a congruence class George + Martha € Z,,? Since George, Martha € Z, 
there exist integers G and M such that 


(8.19) George =[G]m and Martha = [M]m. 
We define 
(8.20) George + Martha = [G+ M]m. 


Here is the catch. Given George € Z,,, there is not a unique integer G such that 
George = [G]m. For example, George = [G+ m]m = [G+ 2m],, also. In fact, 
there are an infinite number of integers that represent George. The same goes for 
Martha. So, for addition of congruence classes to be well-defined, the question we 
need to answer is this: 


If G and G’ both represent George and if M and M’ both represent Martha, 
do G+ M and G’ + M’ both represent the same congruence class? If so, then we 
can define George + Martha to be this congruence class. 


The affirmative answer is provided by Theorem [8.10] 


1In the series of children’s books by James Marshall,/George and Martha|are hippopotami. 


368 8. Congruence Class Arithmetic, Groups, and Fields 


Using the Notorious (red, blue, green) color coding, the addition and 
multiplication tables for Z3 are 


Qw w|t 
Qwa|a 
DAwW|w 
Wm QIQ 
Qwa|o 
wWaraA 


The addition and multiplication tables for Zs; are given by Figure ??. Observe 
that the multiplication tables for (Rs, x5) in Example|6.10]and for (Z;,-) in Figure 
?? are equivalent. 


+ O]s [1Js5 [2]5 [3]5 [Als 
Os | (Ols [ls [2ls [8ls_ [Als 
Js} (Js [2]5 [3]5 [4]5 [0s 
2)5 | (2)5 [8]5 [Als [O]s [Ls 
3)5 | (3]5 [4]5 [0]5 [Js [2]5 
4)5 | [4ls [O]s [Js [2]5 [3]s 

Os [U5 [2)5 [3]5  [4]s 
Ojs | [O]s  [OJs [O]5  [O]s  [O]s 
15 | (OJ5 [15 [2]5 [38]5  [4]s 
2/5 | (O]s [2)5 [4J5 [Js [3]s 
35 | [Ols [8ls [ls [41s [2I5 
Als | [O]5 [4]5 [3]5 [2]5  [1]5 


Figure 8.1.3. Addition and multiplication tables for Zs. 


Figure [8.1.4] visualizes a multiplication table. What is the modulus? 


Exercise 8.6. Prove: If a,b € Z are such that [a]m = [blm, then gcd(a,m) = 
gcd(b, m). 

In particular, if a,b € Z are such that [alm = [blm, then gced(a,m) = 1 if and 
only if gcd(b,m) = 1. 


Exercise 8.7. Prove that [a|m-[6]m = [1]m implies that gcd(a,m) = gcd(b,m) = 1. 
Hint: Apply Theorem [6.16] 


Exercise 8.8. Find all pairs ([a]24, [blas) such that 
(8.21) [aloa : [Dloa = [O]o4 and O0<a<b< 24. 


Hint: There are 27 pairs! 
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Figure 8.1.4. Explain why this colorful diagram represents a multiplication 
table rather than a © 
8.2. Inverses of congruence classes 


Recall from Definition [6.15] that integers a and b are inverses of each other modulo 
m if and only if 


(8.22) ab = 1 mod m. 


In this case, by definition of the multiplication of congruence classes, we have 


[2] yn . [Blin = Ee 
This motivates: 
Definition 8.11. If [a],,, - [b],,, = [1],,, then we say that [a],, is invertible and 
that [0],,, is the inverse of [al,,,. 


Example 8.12. (1) [1]7 is its own inverse. 
(2) [2]7 and [4]7 are inverses of each other since 2-4=8=7+1. 
(3) [lz 
(4) [6]7 is its own inverse since 6-6 = 36 =5-7+ 1. (It is easier to see this from 
(—1)(-1) = 1; why?) 


Example 8.13. [3];2 does not have an inverse. Neither does [6],4. 


|7 and [5]7 are inverses of each other since 3-5 =15=2-7+1. 


Exercise 8.9. Show that [a]m has an inverse if and only if gcd(a,m) = 1. Hint: 
Apply Theorem See also Exercise 


Lemma 8.14. [f a congruence class [a]m has an inverse, then its inverse is unique. 


Proof. Suppose that [6], and [c],, are both inverses of [a]. Then ab = 1 mod m 
and ac = 1 mod m. This implies that a(b — c) = 0 mod m; that is, 


m|a(b—c). 
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On the other hand, ab = 1 mod m implies that gcd(a,m) = 1 (exercise). Thus, by 
Theorem |4.23) we have that 


m|(b—c). 


This proves that [6], = [c]m. 


Remark 8.15. Assuming that you have solved it already, Exercise [6.17] gives a 
quicker proof of the lemma. Conversely, the proof of the lemma gives a solution to 


Exercise [6.17] 


Exercise 8.10. (1) For each congruence class [a]g (mod 8) that has an inverse, 
find its inverse. 


(2) For each congruence class [alg (mod 9) that has an inverse, find its inverse. 


8.2.1. Solving linear congruences by finding an inverse. We find it convenient 
to denote the inverse of [a]; by [a~]m, as long as we don’t confuse a~! with the 


usual multiplicative inverse (which is not an integer unless a = £1). 
Suppose that [a], has an inverse [a~+],,. Then aa~' = 1 mod m. Hence, 
z=a'b 


is a solution to the linear congruence equation 
(8.23) ax =b mod m. 
Indeed, we have that 


ax = aa~'b = bmod m. 
This solution is unique modulo m (see below). So the set of solutions to (8.23) is 
(8.24) [a~*blm = {a b+ km: k € Z}. 
Now we show that the solution to (8.23) is unique modulo m: If az = b mod 


m and ay = b mod m, then a(a — y) = 0 mod m. Since gcd(a,m) = 1, we conclude 
that x — y =0 mod m. 


Remark 8.16. Note that what this means for congruence classes is: If a and m 
are coprime, then for every b € Z there exists a unique solution [z],,, € Zm to 


Solved Problem 8.17. Prove that [67|131 is invertible, and find its inverse. Use 
this to solve the linear congruence 


67x = 3 mod 131. 


Solution. By the Euclidean algorithm, 


131 = 67-1+ 64, 
67 = 64-143, 
64=3-21+1, 


3=1-3+40. 
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Hence gcd(67, 131) = 1. By reversing the Euclidean algorithm, we get 
1=64-—3-21 
= (—21)67 + (22)64 
= (22)131 + (—43)67. 
Therefore, [—43]i31 = [88]i31 is the unique inverse of [67]13:. This implies that 


xz = —43 is a solution to 67z = 1 mod 1381, so that « = —129 is a solution to 
67x = 3 mod 131. Thus, the set of all solutions to 67x = 3 mod 131 is 


[—129]i31 = {-129 + 181k: k © Z} = {24 131k: k € Z} = [2]i31. 
Indeed, we check the particular solution 
(8.25) 67-2 = 1384 =3 mod 131. 


Since gcd(67,131) = 1, we confirm that the general solution differs from 2 by a 
multiple of 131. 


Exercise 8.11. For each 0 <a< 7, use inverses to find the general solution to 
(8.26) ax = 3 mod 7. 


If you prefer, you may state your answer by giving for each0 <a< 7 the set of all 
solutions to (8.26) as a congruence class. 


Solved Problem 8.18. Suppose that [alm -[blm = [clm. Prove that gcd(a,m) 
divides c. 

Solution. By the hypothesis, there exists an integer q such that 
(8.27) ab—c=mq; that is, ab—mq=c. 


This implies that gcd(a,m) divides c. Observe that we also have that gcd(b,m) 
divides c. 


8.2.2. The set of invertible congruence classes. We define the set of invertible 
congruence classes modulo m by 


(8.28) Zr, := {[t]m © Zm : [z]m is invertible}. 


By Lemma [8.14] the inverse of each element Z*, is unique. Given [a], € Z*,, we 
denote its unique inverse by [a~']m. By Exercise [8.9] [a]m is invertible if and only 
if gcd(a,m) = 1. Recall from that R*, := {x € Ry : gcd(x,m) = 1}. We 
can now prove the following. 


Lemma 8.19. The function ¢: R*, > Z*, defined by $(x) = |2|m 1s @ bijection. 


Proof. Firstly, we observe that if « € Rt, then [x], € Z*, since ged(x,m) = 1 
implies that [z],, is invertible. Hence, ¢ is a well-defined function. 


Secondly, it is easy to see that ¢ is a bijection. For suppose z,y € Rm* 
satisfy ¢(x) = o(y). Then [t]m = [y]m, which implies x = ymod™m. Since x,y € 
Rm* C Rm, this implies that « = y. Hence, ¢ is an injection. To see that ¢@ is a 
surjection, suppose that [zt], € Z*,. Then [t]m = [r(#)]m and r(x) € Rp». Since 
[Z]m € Z*,, we have gcd(x,m) = 1, so that gcd(r(a),m) = 1, which in turn implies 
that r(z) € R*,. We now conclude that $(r(x)) = [r(@)]m = [t]m. This proves that 
@ is a surjection. 
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Example 8.20. If p is a prime, then 


(8.29) Zy = {[1],2lp,---+ lp — Up}- 


Solved Problem 8.21. Write out the multiplication table for (Z5,-), where mul- 
tiplication - is defined by (8.18c). 


Solution. We have 
(8.30) Zo = {[1]o, [2]o, [4]o, [5]o, [Z]o, [8]o}. 
The multiplication table is 


l]9 [2}9 |4lo [5]o [Z]9  [8]o 

19 lJ9 [2]o |4Jo [5]9 [lo [8]o 

2)9 | [2]o [4]o [Slo [1]o [5]o [Vo 

(8.31) 4jo | [Alo [8]o [Z]o |2]o [19 [5]o 
5l9 Slo [1]g9 [2]9 [7]o [8lg [Alo 

lo | [719 [5]9 [1]o [8lo [419 [29 

Slo | [8lo [Zo [5]o |4lo [2J9 [Lo 


As we have implicitly assumed above for m = 9, multiplication - on Z,,, restricts 
to ZF: 


Exercise 8.12. In multiplication - is defined on Z,,. Prove that if [a]m, [b]m 
€ Z*,, then [a]m- [b]m € Zr. 


8.3. Reprise of the proof of Fermat’s Little Theorem 


We now reformulate the proof of Fermat’s Little Theorem [6.63] in the language of 
congruence classes. 


Let p be a prime and let a € Z* be such that a is not a multiple of p; ie., p 
does not divide a. 


Let us look at the set Z, of congruence classes modulo p. There are exactly p 
congruence classes and 


Ly = {[0], , [1], yore [p — 1p}. 


We define the function fa : Zp + Z, by 


(8.32) fa([a],) = [aa], = [a]p - [2]p- 


8.3.1. The idea by an example of the proof of Fermat’s Little Theorem. 
Example 8.22 (a = 3 and p=5). Here, we have that 


f3 :Z5 > Zs 
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is given by 
f3([0 5) a 0); . 
f(s) = Bs, 
fs([2]5) = [6]; = [1], 
f3([3 5) a 95 [4]5 » 
fa([4]5) = [12], = [2], 


So, in order, the function f3 maps the sequence [0], , [1]; , [2]; , [3], , [4], to the se- 
quence 


[0], ? [3]; ’ [1]; ? [4]; ? [2], . 


Observe that fs is a bijection from Zs to Zs (itself). Further observe that we can 
read off this sequence from the fourth row of the multiplication table in Figure 


[8.1.3] 
Recall that 3°-! = 34 = 81 = 1 mod 5. A robust way of proving this is the 
following congruence class calculation: 


[4!]; > 


1]; [2]; [3]; [4]; definition of multiplying congruence classes 
= [3]; [1]; [4]; [2], commutativity of multiplying congruence classes 
3 


[ 

[ 
|; [6]; [9]; [12], | observed equality of congruence classes using f3 
= [3-6 
= [34-1-2-3-4]; factoring out 3’s 


= [3*-4!], definition of factorial. 


5 
-9-12]; definition of multiplying congruence classes 


So, in terms of congruences, we have 
4! = 4!-3* mod 5. 


On the other hand, since 5 is prime, we have gcd(4!,5) = 1 by Corollary 2.18} 
Hence 
1=3' mod 5 


by Corollary 


8.3.2. Congruence class proof of Fermat’s Little Theorem. Now we show 
the derivation in the previous subsection that 3+ = 1 mod 5 is robust: 


Theorem 8.23 (Fermat’s Little Theorem redux). [f p is a prime and if a is a 
positive integer which is not a multiple of p, then a?~! = 1 mod p. 

Proof. (1) We claim that 

(8.33) fa : Zp + Zp is a bijection. 


Since the domain Z, has the same number of elements as the codomain Z,, it 
suffices to show that fa is an injection by Theorem 5.63] 


Proof that fa is an injection: Suppose fa([Z1]p) = fa([t2]p). Then 


ax, =ax2 mod p. 
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Since a is not a multiple of p, it follows from Proposition that gcd(a, p) = 1. 
Hence, by Corollary |6.21} we have 


1 =72 mod p. 
This implies [x1], = [v2]p. We have proved that f, is an injection. 
(2) Since fa is a bijection and since f,([0]p) = [0]p, we have that the set 
{[1],.[2],,---» le — I} is equal to the set {[la],, , [2a],,,...,[(p — 1)a],,}. Therefore 


we obtain the second equality in the following display (the first equality is by the 
definition of multiplication of congruence classes): 


[(e—1)!), =[,- 2I,--: lp - 1, 
= [la], - [2a], --- [(p — 1a], . 
Now 
[La], - [2a], --- [(p— 1a], = [(p—1)!- a? )p, 
so that 
(p—1)!=(p—1)!-a?~! mod p. 


On the other hand, since p is prime, we have gced((p—1)!, p) = 1 by Corollary 2.18} 
Hence 


1=a?! mod p. 


This completes the congruence class proof of Fermat’s Little Theorem. 


8.4. Equivalence relations, equivalence classes, and partitions 
In this section we consider the general notion of an equivalence relation. This notion 
is important in many areas of mathematics. 

8.4.1. Relations. 


Definition 8.24. A relation on a set X can be defined as a subset R of the 
cartesian product set X x X. We then define a to be related to b, written as 


(8.34) a~b, 
if (a,b) € R. 


This is a super general definition as we have not imposed any conditions on R. 


Example 8.25. Let X = Z. Let m be a positive integer. Define the relation 
RCZ~x Z to be the subset 


(8.35) R:={(a,b)€Zx Z:a=bmodm}. 


So a ~ 0 if and only if a = b mod ™m. In this sense, congruence (modulo m) is a 
relation. Furthermore, recall that congruence has the properties of being reflexive, 
symmetric, and transitive. 
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8.4.2. Equivalence relations. More generally, we have the following: 
Definition 8.26. Let X be a set. We say that a relation RC X x X is: 
(1) Reflexive if: Vr e X,a~ az. 
(2) Symmetric if: Vz,ye X,a2~y, then y~ a. 
(3) Transitive if: Vz,y,z€ X,v~y and y~ z, then a ~ z. 


Definition 8.27. We say that a relation is an |equivalence relation if it is re- 


flexive, symmetric, and transitive. 


So, of course: 


Example 8.28 (Congruence is an equivalence relation). Given a positive integer 
m, congruence modulo m is an equivalence relation on Z. 


On the other hand, here is a pitfall. 


Non-Theorem 8.29. For any|relation|~ on a set X, the{symmetry property im- 
plies the reflexivity property. 


Non-Proof. Assume the symmetry property. Let « € X. For any y € X, x4 ~ 
y implies y ~ «x. In particular, for y = x we obtain x ~ x. This proves the 


reflexivity property. © 


The problem with the proof above is that by taking y = x in the symmetry 
property, we obtain the implication 


x~aximpliesa~cz, 


which is a tautology and which certain does not imply that x ~ x is true! (More 
abstractly, the implication P > P being true does not imply that the statement P 
is true!) 


8.4.3. Equivalence classes. Suppose that R is an equivalence relation on a set 


X. An is defined to be a set of the form 
(8.36) [ty] = {ye X:yrnah. 
Example 8.30 (Congruence classes are equivalence classes). The equivalence 


classes of the congruence equivalence relation on Z are called congruence classes. 


Non-Example 8.31 (Keep your friends close and your enemies closer). Consider 
the set X of all people and define the relation ~ on X byx~ y if and only if x and 


y are enemies. Then this relation is not transitive, for the enemy of your enemy 1s 
{your Jriend! © 


Moreover, if x is their own (worst) enemy, thenx ~ x. However, if there exists 
a person who is their own (best) friend, then the relation ~ is not reflexive! 


Exercise 8.13. Define a relation ~ on R” by 
(8.37) x~y_ if and only if |x| = lyl. 
Prove that ~ is an equivalence relation on R”. 


Describe the equivalence classes of ~. Hint: Spheres and, in one case, a point. 


376 8. Congruence Class Arithmetic, Groups, and Fields 


Exercise 8.14. Define a relation ~ on the set of real n x n invertible matrices by 
(8.38) Aw PAP, 


where P is also a real n x n invertible matrix. Prove that ~ is an equivalence 
relation on this set. 


We say that A and P~!AP are|similar matrices} (a.k.a. conjugate matrices, 
where being is a more general group theory notion). 


8.4.4. Partitions. Let X be a set. A collection {Ya}ae4 of subsets of X is 
called a{partition| of X if X is the disjoint union of these sets. That is, 


(1) 

(8.39) a | 
acA 

(2) Yan Ye #0 implies a = 8. 
Example 8.32 (Pre-images of a function form a partition). If f: X > Y, then 
(8.40) x= |) F-"@) 

yEeY 

is a disjoint union. That is, the collection {f~'(y)}yey is a partition of X. 


As a further example of this, using the remainder map r: Z > Z,,,, we set that 
the collection of sets 


(8.41) {r7'(a) = [a]m}acRm 
is a partition of Z. This set is the same as the collection Z,,, of congruence classes 
modulo m. 


More generally: 


Lemma 8.33. Given an equivalence relation ~ on a set X, the set of equivalence 
classes 


(8.42) E:={[a]:a0€ X} 
is a partition of X. 


Exercise 8.15. Prove Lemma |8.33 


Note that, for every x € X, for each y € X such that y ~ 2, [y] is considered 
as the same object as [2] in the collection €. For example, if X = Z and ~ is 
congruence modulo 17, then [3] is the same object as [20]. 


8.4.5. Quotients. 


Definition 8.34. We call the set of equivalence classes € the quotient set of X 
under the equivalence relation ~ and we also denote it by X/ ~. 


Example 8.35 (Set of congruence classes). Given a positive integer m, let X = Z 
and let ~ be congruence modulo m. Then the quotient set X/ ~ is Zm, the set of 
congruence classes modulo m. 
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8.4.6. Circles and tori. 


Example 8.36 (The circle as a quotient of an interval). Let X = [0,1], the closed 
unit interval in R. Define the equivalence relation ~ on X by 


(8.43) x~« for all x € [0,1], O~l. 
The quotient set is 
(8.44) X/w= Gell eee. U {0, 1}. 


That is, X/ ~ is the union of the singletons of points in the interior (0,1) of [0, 1] 
together with the 2-element set of the endpoints 0 and 1. So, we can “obtain” X/ ~ 
from X by “gluing” together 0 and 1. 


a aed < O > 


Figure 8.4.1. Left: The unit interval X = [0,1]. Right: A circle of length 1 
obtained by identifying together the endpoints of the unit interval X/ ~. 


Example 8.37 (The circle as a quotient of the real line). Let X = R and define 
an equivalence relation on R by 


(8.45) ony & «-yeZ. 
It is customary to write the quotient set R/ ~ as R/Z. Can you explain why R/Z 
is essentially the same as [0,1]/ ~ in Example [8.35/ 


Exercise 8.16 (A circle of circumference L). Let L be a positive real number. 
Define a relation ~ on R by 


(8.46) any ifand only if «—y€ LZ, 
where LZ := {kL :k € Z}. 
Prove that ~ is an equivalence relation on R. 


In what way is the set of equivalence classes of ~ the same as a circle of 
circumference L? 


Is any of this analogous to taking a string of length L and gluing its ends 
together? 


Example 8.38 (A 2-dimensional torus). Let X = [0,1] x [0,1], the unit square in 
the plane R?. Define an equivalence relation ~ on X by 
(x,y) ~ (a, y) for all (x,y) € X, 
(x,0) ~ (2,1) for all x € [0,1], 
(0,y) ~ (1,y) for all y € [0,1]. 
The quotient set T := [0,1] x [0,1]/ ~ is called a/torus| 
Equivalently, we may define an equivalence relation ~ on R? by (compare with 


Example [8.37) 
(8.47) (z,y)e(a+k,y+) for (x,y)€R? and k,leZ. 


We leave it to the reader to define a bijection between T and R?/ x. 
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a 


Figure 8.4.2. The quotient set T’ is obtained from the unit square by identi- 
fying the two sides labeled a together and the two sides labeled b together. 


Figure 8.4.3. The torus rendered as a surface in 3-dimensional Euclidean 
space. Author: Krishnavedala. Available under the Creative Commons CCO 
1.0 Universal Public Domain Dedication. 


8.4.7. Klein bottles. Re-imagining a/Tom Waits) quote: 


I’d rather have a Klein bottle in front of me than a fundamental 
trichotomy. 


Let us consider Example [8.38] with a twist. Namely, let X = [0,1] x [0,1], and 
define a relation ~ on X by 


(x,y) ~ (x,y) for all (x,y) € X, 
(2,0) ~ (1—2,1) and (1— 2,1) ~ (2,0) for all x € [0,1], 
(0,y) ~ (1,y) and (1,y) ~ (0,y) for all y € [0,1]. 
Let K := [0,1] x [0,1]/ ~ be the quotient set. The set K is called the Klein bottle 


Exercise 8.17. Prove that ~ is an equivalence relation on X. 
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Equivalently, we may describe the Klein bottle as a quotient of R*. Define the 
equivalence relation ~ on R? by 
(8.48) 


ee ee 
(2,y) & ( 1)’x 4 : ) +hy+e) for (x,y) ER? and k, fe Z. 


For example, if @ is even, then (2, y) © (c +k,y+ @) for all k € Z. If @ is odd, then 
(x,y) (l-a+k,y+ @) for all k € Z. We leave it to the reader to check that = 
is an equivalence relation and to define a bijection between K and R?/ ~. 


a 
b b 
a 
Figure 8.4.4. The quotient set K is obtained by identifying sides of a square 


according to the arrows. For example, the bottom side is flipped horizontally 
when identified with the top side. 


Exercise 8.18. Convince yourself that (in some sense!) the quotient set K can 
be rendered as the surface in Figure [8.4.5| Hint: The Klein bottle link above shows 
some steps. 

As a further example, we have the Mobius band, which is the quotient set 
(8.49) M = ((0,1] x R)/&%, 
where ~ is defined by (8.48) with k = 0 and ¢ € Z. 


Equivalently, we may define Mobius band as the quotient of [0, 1] x [0,1] by the 
equivalence relation ~ defined by 


(x,y) = (@,y) for all (x,y) € X, 
(2,0) ~ (1—2,1) and (1— 2,1) ~ (a,0) for all x € [0,1]. 
This is how we make Mobius bands in real life; see Figure [8.4.6] 


We leave it as an exercise for the reader to show that the projective plane RP? 
(see may be considered as the quotient of [0,1] x [0,1] by the equivalence 
relation ~ defined by 


(x,y) ~ (x,y) for all (x,y) € X, 
(2,0) ~ (1—2a,1) and (1— 2,1) & (2,0) for all x € [0,1], 
(0,y) ~ (1,1—y) and (1,1—y) ~ (0,y) for all y € [0,1]. 
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Figure 8.4.5. A Klein bottle made with gnuplot 4.0. Wikimedia Commons, 
author: Tttrung. Licensed under the terms of the GNU Free Documentation 
License, Version 1.2 or any later version. 


Figure 8.4.6. A Mobius band. Credit: Wikimedia Commons. Photo by 
David Benbennick. Licensed under Creative Commons Attribution Share 


Alike 3.0 Unported (https: //creativecommons.org/licenses/by-sa/3.0/deed.en) 


license. 


8.5. Elementary abstract algebra 


Throughout this book, we have been considering algebraic structures. In this sec- 
tion we will consider various basic examples and their abstractions. Roughly speak- 
ing, these abstractions are based on “how many” properties we decide to impose 
and of course the nature of these properties. 


8.5.1. The integers and other examples of algebraic structures. The most 
basic algebraic structure we have been discussing is the integers. The integers Z is 
a set and we have a pair of on Z of denoted by +, 
and multiplication, denoted by - or x. That is, + assigns to an ordered pair of 


integers (a,b) an integer a+ b which is their sum. 
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The integer 0, read as “zero”, is special in that it is the additive identity: 
For any integer a, we have 


(8.50) O+ta=a+0=a. 

Addition is commutative: For any integers a and 8, 

(8.51) a+b=b+a. 

Addition is associative: For any integers a, b, and c, 

(8.52) (a+b) +c=a+(b+c). 

For each integer a there exists an additive inverse —a satisfying 
(8.53) a+ (—a) =(-a)+a=0. 


In general, a set with a binary operation which has an identity element, is 
commutative and associative, and for which each element has an inverse is called 
an abelian group (see below the abstract definition). In particular, the 
integers Z with addition + is an abelian group. We remark that “abelian” is a 
synonym for “commutative”. 


Here is another example of an abelian group: 


Example 8.39. Let X = {e,o0}, which is a set with two elements, one called “e” 
and one called “o”. Define the binary operation © taking an ordered pair (a,b), 
where a,b € X, toa@b, by the addition table in Table [85.1] 


Table 8.5.1. Addition table for the abelian group ({e, o}, @). 


That is, the addition table defines 


(8.54) e®e=e, e®o=o0, oGe=o0, oPo=e. 


One can see by inspection and a short calculation that e is the additive identity 
and © is commutative and associative. We also see that each of e and o is its own 
inverse. The color coding in is such that purple means red and blue. Note 
also that explains how we should read the addition table in Table [8.5.1] 


Exercise 8.19. Verify the statements above that the set X with the “addition” 
binary operation ® given by the table in Example [8.39] satisfies the properties that 
e is the additive identity and ® is both commutative and associative. And each of 
e and o is its own inverse. Thus, (X,@) is an abelian group, as defined in 
below. 


Exercise 8.20. Write out an addition table for parity, where the two elements 
being multiplied are “even” and “odd”, using the theorems in How does your 
addition table for parity compare to the addition table in Table [8.5.1)? Try to be as 
specific as possible with your comparison. 
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Exercise 8.21. Write out a multiplication table for parity, where the two elements 
being multiplied are “even” and “odd”, using the theorems in 41.2.3} 


Now we turn our attention to multiplication. For the integers, multiplication - 
assigns to an ordered pair of integers (a,b) an integer a-b which their product. For 
example, it assigns 15 to (3,5). 

The integer 1 is special in that it is the multiplicative identity: For any 
integer a, we have 
(8.55) l-a=a-l=a. 

Multiplication is commutative: For any integers a and 8, 
(8.56) a-b=b-a. 
Multiplication is associative: For any integers a, b, and c, 
(8.57) (a: b)-c=a-(b-c). 

We say that integers a and 6 are inverses of each other if 
(8.58) a-b=b-a=1. 

Since 1-1 = 1 and (—1)-(—1) = 1, 1 has the multiplicative inverse 1, and —1 has the 
multiplicative inverse —1. But no other integers have multiplicative inverses! That 
is, if a is an integer that is neither 1 nor —1, then there does not exist an integer 
b satisfying (8.58). That is the bad news. But the good news is that addition 
and multiplication cooperate to give some nice properties. Namely, we have the 
distributive property: For any integers a, b, and c, we have 

(8.59) a:(b+c)=a-b+a-c. 

Since multiplication is commutative, we obtain “right” distributivity as well as the 
above “left” distributivity: 

(8.60) (b+c)-a=b-at+c-a. 


More generally, when a set with a pair of binary operations + and - satisfies 
the properties above, we say that it is a commutative ring! In particular, (Z,-+,-) 
is a commutative ring, called the ring of integers. So, by a ring, we do not mean 
something you wear to signify that you are married, or the sound a doorbell makes. 
Rather it indicates the presence of two binary operations which we call addition 
and multiplication on a set satisfying the properties above. 


Exercise 8.22. Let © denote the multiplication you defined for parity in Exercise 
Show that the set X = {e,o} with the binary operations of “addition” ® 
and “multiplication” ® satisfies all of the same axioms stated above that the set of 
integers Z with addition + and multiplication x satisfies. For this reason, we call 
(X,®,®) a commutative ring. 


Solved Problem 8.40 (Multiplication on the set of remainders modulo 3). Define 
the set R3 := {0,1,2}. Define the binary operation © on R3 by 


ab if ab < 3, 
8.61 b:= 
ee) ie re if ab > 3. 


Write out the multiplication table for (Rs, ©). 
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Solution. It is easy to see that if a € Rs, then 
(8.62) a©0=00a=0, a@®@l=1Oa=a. 
The only remaining product to compute is 
(8.63) 202=2-2-3=1. 
So, piecing all of this together, we obtain that the multiplication table 


© 0 1 2 

0;000 061 062 
1/160 161 162 
2)/2600 20601 262 


is given by 


NF O]® 
So: So 
mF Ole 
Bw o];w 


We remark that this is the same as congruence class multiplication on Zs. 
Exercise 8.23. Let m be a positive integer. Define the set 

(8.64) Rm := {0,1,2,...,m— 1}. 

Another way to describe this set is 

(8.65) Rn = {aE Z:0<a<mb}. 

Define the binary operation ® on Ry, by 


aobm={ a+b ifatb<m, 


xe) a+b-m ifat+b>m. 


Write out the addition tables for Rog and R3, both with the binary operation @. See 
Table [8.5.1] for an example of an addition table. 


Note that the binary operation @ on R,, above is the same as congruence class 
addition on Zn. 


Exercise 8.24. Let m be a positive integer. Prove that if a,b € Rm, thna®be 
Rm. This just says that indeed ® is a (well-defined) binary operation on Rm. 


Exercise 8.25. Show that the binary operation © on Rm is commutative and 
associative. How does (Rm, @®) generalize Example [8.39]? 


In this subsection we have considered the ring of integers (Z,+,-). If we only 
consider the additive structure + on Z and forget about multiplication -, we have 
what is known as a group structure on Z. 


384 8. Congruence Class Arithmetic, Groups, and Fields 


8.5.2. What is a group anyway? In general, we have the following. 


Definition 8.41. A set G together with a binary operation © : G x G > G is 
called a{group|if it satisfies the following axioms: 


(1) (Associativity) For any x,y, z € G, 


(8.67) (xOy)Oz=2O(y©2z). 

(2) There exists an identity element e with the following property: For any 
reEG, 

(8.68) eOur=xOe=e. 


(3) Each x € G has an inverse element, which we denote by 2, satisfying 


(8.69) roe 2 @r=-<, 


8.5.3. Types of groups. 
Definition 8.42. We say that a group (G, ©) is a/finite group if G is a finite set. 


We call its cardinality |G| the order] of the group. 
If G is not finite, then we say that it is infinite. 


Definition 8.43. We say that a group (G, ©) is|abelian|if © is commutative; that 
is, for every x,y € G, 
(8.70) rOy=yOr. 


Definition 8.44. A group (G,©) is a|cyclic group if there exists an element a € G 
such that each element g of G is equal to at least one of the following forms: 


(1) g=e. 
(2) o=a =aOaO-:-O4, where k € Zt. 
$$ —_—_ 


k times 
(3) g=(a 7) := a_'@a!@---@©a_!, where £€ Zt. 


é times 


We call the element a a\generator of G, and we write G = (a) (cf. (8.110) below). 


We next consider examples of the definitions above. 


8.5.4. Jostling examples and non-examples of groups. 


Example 8.45 (The additive integers). Let G = Z and let © = +, the usual 
addition of integers. This is an infinite abelian cyclic group. The identity element 
is given by e = 0. The inverse is given by «~! = —x. A choice of generator for Z 
is 1. The only other possible choice of generator for Z is —1. 


Taking the generator to be 1, we have: 


(1) Ifg > 0, then g= 19 :-=1+1+---+1. 
—__—_—— 


g times 
, = (jC) S77 Stee 
(2) If g < 0, then g = (-1) si oa al Oa 1. 


—g times 
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Non-Example 8.46. Let G = Z and let © =- be multiplication. Then (G,©) = 
(Z,-) is not a group. This is because, although e = 1 is an identity element, the 
element 0 does not have an inverse. That is, there does not exist any element, call 
this mythical object 0~', such that 

(8.71) 0-0" =1, 


Note that neither does 2 have an inverse since $ is not an integer. In fact, the only 


integers with multiplicative inverses are 1 and —1. 


Example 8.47 (Set of congruence classes with addition). Now let m be a positive 
integer, and let Z,,, be the set of congruence classes modulo m as defined in (8.15). 
Let + be the binary operation on Z,,, defined by (8.18al). Then 


(Zim; +) 


is a finite abelian cyclic group of order m, where the inverse is given by ([a]m)~+ = 
[—@]m- 


Exercise 8.26. Prove that, as claimed in Example[8.47 for everym € Zt, (Zm,+) 
is an abelian group. 


Non-Example 8.48. Let - be the binary operation on Z,, defined by (8.18c), where 
m > 2. Then 


(Zm, 2 
is not a group. 


Exercise 8.27. Prove that, as claimed in Non-Example [8.48] for every m € Zt 
with m > 2, (Zm,-) is not a group. Hint: The element [1]m is the identity, but 
what element has no inverse? 


Example 8.49 (Non-zero congruence classes mod 5 with multiplication). Let 
(8.72) Zs ‘= {[1]s, [2]s, [3]s, [4]5} 


be the set of non-zero congruence classes modulo 5. From the multiplication table 
in Figure|8.1.3) we see that Z> with the binary operation - is an abelian group. For 
example, the identity element is [1]; and the list of inverses is 


(8.73) s'=(ls, 2Is*=[]s, Bls’=B2ls, [4]5* = [4ls. 
Non-Example 8.50. Let 
(8.74) Zi := {[1a, 2]a, [3]a}- 


We see that Le with the binary operation - is not a group. In particular, while [1]4 
is still an identity element, the element [2]4 does not have an inverse. 


We can rectify this by defining instead 
(8.75) Zi = {[1a, [Ja}. 


In this case, (Zj,-) is an abelian group! 


Exercise 8.28. Prove that (Z4,-) is an abelian group. 
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Exercise 8.29. Define the dihedral group| Dz as follows. Its four elements are the 
functions from R? to R? ~ by 


(8.76) : (2, y) + (2, y), 
(8.77) a: (z,y) +> (—2,y), 
(8.78) b: (x,y) + (x, —-y), 
(8.79) c: (@,y) > (—a,—-y). 


That is, 1 = Ip2 is the identity function, a is a reflection about the y-axis, b is a 
reflection about the x-axis, and c is rotation by 180° (a.k.a. the antipodal function). 
Define multiplication to be the composition of functions. For example, we have 


(8.80) a-b:=aob: (x,y) (x,y) 4 (—2,-y), 


so we see thata-b=c. 


(1) Write out the multiplication table for (D2,-). 
(2) Prove that (Do2,-) is a group. 
Exercise 8.30. Let m be a composite number, and define 
(8.81) ZH, = {lilm :0<i< my}. 
Prove that (Z%,-) is not a group. Hint: Not only are there element(s) with no 


inverse(s), congruence class multiplication - on the set Z# is not well-defined. 


Example 8.51 (Non-zero congruence classes mod a prime with multiplication). Let 
p be a prime. Define Z> to be Z, — {[0]p}, that is, the set of non-zero congruence 
classes modulo p. Then 
is an abelian group. 

(1) The identity element is [1]). 


(2) We claim that each element [a], € Z> has an inverse. To see this, recall that 
a and p are coprime and hence by Theorem there exists b € Z such that 


8.82) ab = 1 mod p. 
This implies that 
8.83) [a]p - [Bly = [Oly [alp = [L]p- 


(3) The rest of the group axioms are easy to verify. 


Non-Example 8.52. The set of complex numbers C with complex multiplication 
-, as defined in §8.7.3.1] is not a group. For 0 does not have an inverse! 


Example 8.53 (Non-zero complex numbers with multiplication). Let C* := C — 
{0} be the set of non-zero complex numbers. Then (C*,-) is a group. The inverse 
of a non-zero complex number z is 

= z 


where Z is the complex conjugate defined by (8.171). 


Non-Example 8.54. Can we see another example instead? © 
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Example 8.55 (General linear groups of matrices). Important examples of groups 
are matrix groups. A couple of examples are: 


(1) The (real) \general linear group) GL(n,R), which is the set of real 
nm X n matrices with multiplication given by matrix multiplication, Recall 
that a|square matrix|is if and only if its is non-zero. 

(2) The\complex general linear group|GL(n,C). This is the set of invertible com- 
plex n x n matrices with multiplication given by matrix multiplication. 


We will not discuss these groups in detail in this book. 


Exercise 8.31. Prove that if a group (G,©) is cyclic, then it is abelian. 


8.5.5. Basic properties of groups. Let (G, ©) be a{monoid} that is, the binary 
operation © is associative and there exists an identity element e. Let a © G. We 


say that b is alright inverse of a if 
(8.85) a®b=e. 
We say that c is a left inverse of a if 


(8.86) cOa=e. 


Suppose that 6 is a right inverse of a and that c is a left inverse of a. Then we 
have 


(8.87) b=e-b=(c-a)-b=c-(a-b)=c-e=c. 


That is, for every element in a monoid, if it has both a right inverse and a left 
inverse, then they are equal. 


Exercise 8.32. Let (G,-) and (H,*) be groups. Their product group is the set 


G x H with multiplication o defined by 


(8.88) (91, h1) © (ga, ha) = (gi « ga, hi * he). 


Prove that this product group is indeed a group. 


8.5.6. Subgroups (less is more). Let H be a subset of a group (G,®). We 
say that (H,©) is a\subgroup|of (G, ©) if (H, ©) itself is a group. By necessity, this 


means: 


(1) A is closed under multiplication; that is, for every x,y € H, xy € H. 
(2) e € H, where e is the identity element of G. 
(3) If « € H, then x! € H, where x~! is the inverse of x in G. 


Often we omit the multiplication symbol © in the notation for a group. 
If H is a subgroup of G, we denote this by 


A<G. 
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Example 8.56. Let m be a positive integer. Then (mZ, +), where 


(8.89) mZ := {mu : a € Z}, 
is a subgroup of the additive group of integers (Z,+). Observe that, as sets, 
(8.90) mZ = [O|m, 


the congruence class of 0 modulo m, which is defined in Definition [8.1] 
For example, 2Z is the additive group of even integers. 
Observe that if a positive integer r is a multiple of m, then (rZ, +) is a subgroup 
of (mZ, +). 
Exercise 8.33. Let E and O denote the set of even and odd integers, respectively. 
(1) Is (E,+) a subgroup of (Z, +)? 
(2) Is (O,+) a subgroup of (Z, +)? 
(3) Is (Z=,+) a subgroup of (Z,+)? 


Example 8.57. Let m = ab be a composite positive integer, where a,b > 2. Then 
([a]m) := {[ka]m :0< k < b} 

is a subgroup of (Zm, +) of order b (cf. (8-110)). 

Exercise 8.34. Prove the statements in Example [8.57] 


We say that a subgroup H of a group G is non-trivial if H 4 {e}, where e is 
the identity element. 


Lemma 8.58 (Classification of subgroups of Z). If (H,+) is a non-trivial subgroup 
of (Z, +), then there exists a positive integer m such that 
(8.91) H=mlZ. 


Proof. Since the subgroup #H is non-trivial, there exists a non-zero integer a € H. 
Since H is a group, —a € H. Define b = |a|. Then b€ HNZt. Since HN Zt isa 
non-empty set of positive integers, by the well-ordering principle it has a minimum 
element. Let m denote the minimum element of the set HN Z™. 


Claim. H = mZ. 


Firstly, since m € H and H is a group, we have mZ C H. 


Secondly, we prove that H C mZ. Suppose for a contradiction that there exists 
an integer c with c€ H andc ¢ mZ. By the Division Theorem, there exist integers 
q and r satisfying 


(8.92) c=mqtr and 0<r<m. 
The reason why r > 0 is that c ¢ mZ. Since c,m € H, we have 
(8.93) r=c—mq€ H. 


However, this contradicts m being the minimum element of HNZ* since 0 < r < m. 
Since we have a contradiction, we conclude that H C mZ. 


Exercise 8.35. Show that the unit complex numbers (the set of z € C satisfying 
|z| = 1) with complex multiplication, form a subgroup of (C*,-). 
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8.5.7. Finite groups. 


Theorem 8.59 (Lagrange). If G is a finite group and if H < G is a subgroup of 
G, then their orders (see Definition [8.42] have the property that 
|H| divides |G]. 


Example 8.60. Let m be a positive integer and let a be a positive divisor of m. 


Then ({a]m) is a subgroup of Z,, of order ™, which is a positive integer dividing 


m. For example, taking a = 3 and m = 15, we see that 
([3]as) = {[0]15, [3]i5, [6]as, [9]as, [12]15} 


is a subgroup of Z15 of order 5, where the group multiplication is given by congru- 
ence class addition. 


We now discuss some concepts we will need for the proof of Lagrange’s Theorem, 
which will be given at the end of this subsection. Using the subgroup H, we can 
define an equivalence relation on G by 


y ~ x if and only if there exists h € H such that y = zh. 
We verify that ~ is: 

(1) Reflexive: x ~ x since x = ze for all x € G and the identity element e of G is 
in A. 

(2) Symmetric: If x ~ y, then there exists h € H such that x = yh. This implies 
that y = ah~!. Since H is a subgroup, h~! € H and hence y ~ «. 

(3) Transitive: If « ~ y and y ~ z, then there exist h,k € H such that x = yh 
and y = zk. Thus x = zkh. Since H is a subgroup, kh € H. Thus z ~ z. 


If x € G, we denote the equivalence class of x by 
(8.94) eH =(t]={yeG:yr~nus={ch: he H}, 
and we call it the (left) |coset|of x. Since ~ is an equivalence relation, the collection 


of distinct cosets partition the set G. 


Example 8.61. Although we are presently interested in finite groups, for simplicity 
we first consider the infinite group (Z,+). Let H be a non-trivial subgroup of Z. 
By Lemma[8.58] we have H = mZ for some positive integer m. 


Let x € Z. Then the left coset 
(8.95) cH ={xt+h:hemZ}={x+mk:k € ZS = [tlm 
is the congruence class of « modulo m. 
Example 8.62. Recall from Example 8.47] that (Z,,+) is a finite abelian cyclic 


group of order m. Assume, more generally, that (G,©) is a finite abelian cyclic 
group of order m. Suppose that a € G is a generator of G. Then 


(8.96) Ga{as0<k< mf}, 


where a® := e is the identity element. 


Let c be a positive divisor of m, and define 
(8.97) H := (a°). 
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Let b := m/c, which is a positive integer. Then 


(8.98) H= {a9 :0<j < Od}. 
The distinct cosets are 
(8.99) aH, O<i<e, 


Now let us return to the general case of a subgroup H of a finite group G. We 
leave it as an exercise that the function 


(8.100) fe: H> cH 

defined by 

(8.101) fe(h) =ch forhe H 

is a bijection. From this and Theorem |5.62| we conclude that for each x € G, 
(8.102) |cH| = |A| 


(is independent of x). 


Exercise 8.36. Prove that f, : H — cH defined by (8.101) is a bijection. 


The of the subgroup H of the finite group G is the number of distinct 
cosets and is denoted by 


(8.103) IG: HI. 


Proof of Theorem Since there are |G: H| cosets, since each coset has || 
elements, and since the cosets partition G, we see that 

G 
(8.104) IG| =|H|-|G: HH]; that is, IG: |= 
Since |G : H| is a positive integer, we conclude that |H]| divides |G|. This completes 
the proof of Theorem [8.59] 


Exercise 8.37. Find all of the subgroups of the dihedral group (Do,-) defined in 
Exercise |8.29 


8.5.8. Normal subgroups and quotient groups. We say that a subgroup H of 
a group G is normallif for all g € G we have 
(8.105) gH = Hg, 


where Hg := {hg : h € H} is the right coset of g. Equivalently, ghg~! € H for 
all g € G. If G is abelian, then any subgroup of G is normal. 


Given a normal subgroup H of G, we can define the G/H to be 


the set of left cosets: 


(8.106) G/H = {gH :g€G}. 
The group multiplication on this set is defined by 


As an exercise, you may check that this is a well-defined group by using the as- 
sumption that the subgroup H is normal. 
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Example 8.63. Let m be a positive integer. Since mZ is a normal subgroup of the 
additive group of integers (Z,+), we have the quotient group (Zm,+) := Z/mdZ, 
where mZ is defined by (8.89) as mZ = {mz : x € Z}. This quotient group is 
naturally isomorphic to the additive group of congruence classes, where a left coset 
a(mZ) corresponds to the congruence class [a] for a € Z. 


For an example for the problems that arise when we try to define the quotient 
group using a subgroup that is not normal, see Non-Example [8.78] below. 


8.5.9. Subgroup generated by an element. Let (G,©) be a group. Given 
a € G, we define the powers of a as follows. The positive powers are defined by 


(8.108) a t=0, oc MHaGn of Hageos, <u, a =a©aG®::-Oa. 
—___ —__ 
k times 
We define the negative powers by 
(8.109) a Sa Oa O-On* 
———__ 
k, times 


for k € Z+. The 0-th power is a®° := e. 
Let (G,©) be a group and let a € G. The subgroup of G generated by a is 


(8.110) (a) = {a*:k € Z}. 
Note that (a) is a cyclic group. 


Example 8.64. If m is a positive integer in the additive group of integers (Z, +), 
then 


(8.111) (m) = mZ. 


Lemma 8.65. If (G,©) is a finite group and if a € G, then there exists a least 
positive integer n such that 


(8.112) a” =e. 
Moreover, 
(8.113) n = |(a)|. 


The order n of the group (a) is also called the order of the element a. 


Proof. To show that there exists a least positive integer n satisfying (8.112), by the 
well-ordering principle (see {4.1.4) it suffices to show there exists some n satisfying 


Suppose for a contradiction that there does not exist any n € Z* such that 
a” =e. Then we claim that 


(8.114) a’ #a) for i,j € Z* satisfying i Aj. 


To see this, suppose i,j € Z* satisfy at = a). Without loss of generality, we may 
assume that i < 7. We compute that 


(8.115) e= alg — a} (a~')' = a)? 
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Since 7 — i > 0 and since there does not exist any n € Zt such that a” = e, we 
obtain that 7 — i = 0; that is, i = j. We conclude that some n € Z™ satisfies 
(8.112). 

Let n be the least positive integer satisfying (8.112). Then 
(8.116) (Oo) =4ea a ot 


where the elements on the right-hand side are all distinct. Therefore |(a)| = n. 


Example 8.66. Let m be a positive integer, and let (Z,,,,+) denote the additive 
group of congruence classes modulo m. Then the order of an element [a], is equal 
to aaa We leave it as an exercise for you to prove this. 


Theorem 8.67. Let (G,©) be a finite group, and leta € G. Then 
(8.117) alGl =e. 


Proof. We have that (a) is a subgroup of G. Hence 
|(a)| divides |G]. 
On the other hand, by Lemma\[8.65] 


(8.118) all = e, 

Since |(a)| divides |G], there exists a positive integer k such that 
(8.119) \(a)| k = |GI. 

Therefore 

(8.120) giGl = (al(olyr =e =e. 


8.5.10. The multiplicative group Z*,. Recall from (8.28) that Z*, is the set 
of invertible congruence classes modulo m. Equivalently, 


(8.121) Zr, = {alm € Zm : gcd(a,m) = 1}, 

and this set is well-defined since by Exercise [8.6] we have the following: 

If a,b € Z are such that [a]m = [b]m, then gcd(a,m) = 1 if and only if gcd(b, m) = 1. 
a m = p isa prime, the definition in agrees with the definition in 


Lemma 8.68. Let m € Zt. We have that (Z*,,-) is an abelian group, where 
multiplication - is given by congruence class multiplication. 


Exercise 8.38. Prove Lemma|8.68| Hint: The main point is that any element of 
Zx, has a multiplicative inverse. 


Example 8.69. (1) We have 


(8.122) Zé = {[1le, [5]et, 
where the multiplication table is 

(Zi,-) | Mo [Blo 

(8.123) [Je | [lle [le 

[le | [Je [1e 
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(2) We also have 


(8.124) Zio = {[1]10, [3]10, [Z]10, [r0}, 
where the multiplication table is 
(Zio,*) | Hho [Bho [Zio [910 
[110 | [10 [3l10 [710 [9]10 
(8.125) [3]10 3jio [910 [110 [Z]10 
[7l1o | [710 [lio [910 [3)a0 
[910 | [910 [Zo [S}ro [1]10 


Exercise 8.39. As in Example [8.69] for Z% and Zj,, write out the multiplication 
tables for both Zi, and Zj,. 


Exercise 8.40. Let m € Z*, and suppose that R is a reduced residue system 
modulo m as defined in {6.10.1} Prove that 


(8.126) {la]jn :a€ R} =Z*. 


8.5.11. Another proof of Fermat’s Little Theorem. The following gives an- 
other proof of Fermat’s Little Theorem [6.63] 


Exercise 8.41. Let p be a prime number. Recall from Example that (Zz,-) is 
a group. 
(1) Let [a], € Z5. Consider the subgroup (|a]p). Explain why k := |(|a]p)| divides 
p-l. 
(2) Deduce that [a?~*], = ({alp)?~! = [1]p- 
8.5.12. Proof of Euler’s Theorem. Let nm > 2 be an integer. Recall that 


Euler’s totient function y : Z* + Z* is given in Definition by y(n) is the 
number of integers k satisfying 1 < k <n and gcd(k,n) = 1. Recall also that 


(8.127) R* := {ae Ny» : gcd(a,m) = 1}. 


We have seen that the map from R*, to Z*, defined by a +> [a]m is a bijection. In 
particular, 


(8.128) IZ%,| = [Ril 
By (8.128), we have 
(8.129) p(m) = |Rm| = |Znl- 


Recall Euler’s Theorem [6.79] (generalizing Fermat’s Little Theorem), which we 
stated earlier and we can now prove using the group theory we have developed. 


Theorem 8.70. If positive integers a and m > 2 are coprime, then 
(8.130) a?"™ = 1 mod m. 
In other words, 


(8.131) [a] = [1}m- 
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Proof. By Lemma|8.68] (Z*,,-) is a group. Since a and m are coprime, we have 
[alm € Z*,. Thus, by Theorem [8.67] we have 


(8.132) [a] 2™) = fal! = [tin 


is the identity element of Z*,. 


8.5.13. Group morphisms. Let (G,-) and (H,*) be groups. A function f : 
G- Hisa if f “preserves” the multiplication structures; that 


is, 
(8.133) f(a-y) = F(z) * f(y). 
If a homomorphism f is an injection, then we say that f is almonomorphism| 


If a homomorphism f is a surjection, then we say that f is an 
If a homomorphism f is a bijection, then we say that f is an isomorphism]| 


and (G,-) and (H, *) are isomorphic (as groups). 

Example 8.71. The group (mZ, +) in Example [8.56]is isomorphic to (Z, +). 
Exercise 8.42. Prove that for each positive integer m, there are exactly two iso- 
morphisms from (Z,+) to (mZ, +). 


Exercise 8.43. Prove that (Zj,-), where Zi is as in (8.75), is isomorphic as a 
group to (Z2,+). Recall that group isomorphisms are defined in {8.5.13] 


Example 8.72. Let m be a positive integer, and let 1 denote the 
Let 


(8.134) Zee eli ke Zh. 


The set Z,, is closed under complex multiplication -, and (Z,,,-) forms a finite 
group of order m, called the group of m-th roots of unity. 


Define the function 


(8.135) fila Ze 

by 

(8.136) f([Klm) = e827*/™, 
Then (see Exercise below) 

(8.137) f : (@m,+) > (Zm,;°) 


is not only a well-defined homomorphism, it is an isomorphism! 
Observe that for each k, 

(8.138) Cl a eae 

For this reason, we call each e¢27*/™ a\root of unity) 


For example, Z3 consists of the 3-rd roots of unity: 
i2n-1/3 _ 1 si v3; ei 2m2/3 i. V3. 


2 2 2 2 


Exercise 8.44. Prove that, as claimed above, the function f : (Zm,+) > (Zm,°) 
defined by (8.137) is a well-defined group isomorphism. 


(8.139) ud ae ae 
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efti/T 
i . 
e27t/7 
ett/T 
< e2 
e8tt/7 
1271/7 
elOni/7 


Figure 8.5.1. The 7-th roots of unity. The group (Z7,-) is isomorphic to (Z7,+). 


8.5.14. The symmetric group. Let X be any set. A/permutation|o of X is 


defined to simply be a bijection 0: X > X. 


Let S(X) be the set of bijections from X to itself. Recall from Theorem 3) 
that the composition of two bijections is a bijection. Thus, composition o is a binary 
operation on S(X). 


Lemma 8.73. For any set X, (S(X),0°) is a group, called the| symmetric group| of 
Xx. 
Proof. We verify the group properties in Definition [8.41] 


(1) By Theorem .39{1), composition is associative. 
(2) Let 1x denote the identity map. By Theorem 1), we have 


ly cog =aolx =a. 


(3) Let o € S(X). Since a: X > X is a bijection, by Theorem [5.53} there exists 
an inverse 0~!: X + X. This is indeed the group inverse: 


(8.140) cog '=oq 'oag=1x. 


Example 8.74. Let X = N3 = {1,2,3}. Let us express a function f : N3 > N3 by 
the 2 x 3 matrix 


1 2 3 
ae (ray re) 40))- 
For example, the matrix 
(3.12 Ge) 


denotes the function g defined by g(1) = 2, g(2) = 3, g(3) = 2. Observe that g is 
not a bijection and hence is not a permutation. So g ¢ (Ns). 
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Using the notation above, we can list all of the elements of S(N3) as follows: 
(8.143) 


1 2 3 1 2 8 1 2 8 1 2 3 1 2 8 1 2 8 
1 2 3/’\1 3 2/7’? \2 1 3/’ \2 3 1/7’ \38 1 2)’ \38 2 1)° 
So |S(N3)| = 6. 


Example 8.75. More generally, let n be a positive integer, and let X be a finite 
set with n elements. For this discussion, we only care about the cardinality of X, 


so for convenience we choose X to be the set N, = {1,2,...,n} consisting of the 
first n positive integers. 

We denote 
(8.144) Sn = S(Nn), 


which is called the symmetric group on n elements (or over n symbols). 


By definition, a permutation o of the set N,, is defined simply to be a bijec- 
tion a0 : N, — N,. Observe that o is a bijection if and only if the n elements 
a(1),0(2),...,0(n) are distinct. 


We can count the number of bijections as follows: 


(1) Firstly, there are n choices for o(1): namely any one of 1,2,...,n. 


(2) Secondly, independent of the choice of o(1), there are n — 1 choices for (2) 
since o being bijective implies that (2) 4 o(1). So a(2) can be any element 
in the set 


N, — {a(1)}. 
(3) Thirdly, independent of the choices of o(1) and a(2), there are n — 2 choices 
for o(3) since o being bijective implies that o(3) is an element in the set 


Nn oo {a(1), a(2)}. 


(4) Continuing in this way (and no, we will not try to formalize this argument by 
induction!), we see that for the n successive choices of a(i), 1 <i <n, since 
o is a bijection, the numbers of possible choices are shown in Table [8.5.2] 


Table 8.5.2. The top row entries are the elements of Ny: 1, where 1 <i <n. 
The bottom row entries are the corresponding number of possible choices for 


a(t). 


choice | 1 2 3 + n—-lin 
number of choices | n n-l n-2 -::- 2 1 


(5) By the multiplicative law, we see that the cardinality 
(8.145) |S(N,,)| =n-(n—1)-(n—2)---2-1L=nl. 


Alternatively, (8.145) follows from Corollary [7.38 
This also agrees with Example [8.74] where we saw that |S(N3)| = 6 = 3!. 
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We say that a permutation 7 of a set X is a if there exist two 


distinct elements x7, and x2 of X such that 
(8.146) T(w) =a if ce X—{a1,22}, T(a1)=%2, T(x2) = 21. 
That is, 7 switches two elements while keeping all other elements the same. 


We end this subsection with two fundamental results, which we will not prove. 


Fact 8.76. If X is a finite set, then any permutation of X is the composition of 
transpositions. 


Fact 8.77 (Parity of permutations). Let o be a permutation of a finite set X with 


|X| > 2. Ifo is the composition of k transpositions, then either 


(1) k is always even, in which case we say that o is an even permutation, or 


(2) k is always odd, in which case we say that o is an odd permutation. 


Non-Example 8.78. Using the symmetric group S3, we can give a simple example 
for why we need a subgroup to be normal in order to define the quotient group. Let 


a: 
i,j,k € Ng be distinct. Let 7; denote the tranposition (; J 4 , let Tiz~ denote 
the permutation ‘ “ , and let e = 1y, denote the identity element. Consider 


the subgroup H = {e,712} of Ss of order 2. Using that multiplication in the group 
S3 is given by composition bijections of N3, we see that the left cosets of H in S3 
are 


eH = T12H = A, 
1 123H = 713H = {7123, 713}, 
1132H = T23H = {7132, T23}- 


From this we can easily deduce that left coset multiplication is not well-defined. 
For example, computing multiplication using different representatives of the same 
cosets, we have 


71237132 H = A, whereas 713723 = 1132H x A. 
8.5.15*. Applications of group theory. Group theory has been applied to 
study the mathematics of Rubik’s cube and the mathematics of We invite 


you to click on the Wikipedia links to learn more. 
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Recall that we discussed the ring of integers in In general, we have: 


Definition 8.79. A ring|is a set R with binary operations + called addition and 
- called multiplication such that the following hold: 


(1) (R,+) is an abelian group (see Definition [8.43). 
(2) (R,-) is a monoid. 
(3) (Distributivity) For all x,y,z € R, 
(8.147) “-(ytz)=a-y+n-z, (y+2)-e=y-Ute-a. 
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For example, Z, Q, and R, with the usual addition + and multiplication -, are 
all rings. 


A ring is an integral domainjif « 4 0 and y £ 0 implies that x-y #0. 
Lemma 8.80 (Cancellation property). For anyx 40, ifu-y=a-z, theny =z. 


Proof. Exercise! 


For example, (Z,+,-) is an integral domain; in fact it is the prototypical exam- 
ple. 


If p is a prime, then (Zp, +,-) is an integral domain. 


Example 8.81. The ring Z[z] of polynomials in a single variable with integer 
coefficients is an integral domain with the usual addition and multiplication. We 
leave it to the reader to check the ring structure. Moreover, clearly the product 
of two non-zero polynomials with integer coefficients is a non-zero polynomial with 
integer coefficients. 


On the other hand, if m is a positive integer, then (Z,,,+,-) is a ring. However, 
unless m is a prime, it is not an integral domain. Indeed, if m = ab where a,b > 1 
is a composite number, then [a]m 4 [O]m and [b]m 4 [O]m, but [alm + [b]m = [O]m.- 


Definition 8.82. Let (R,+,-) be a ring and let S be a subset that is closed under 
both + and -; that is, the sums and products of elements in S remain in S. We say 
that (S,+,-) is a subring| if it is a ring. By the uniqueness of the multiplicative 
identity, the multiplicative identity of S is equal to the multiplicative identity of R. 


Example 8.83. The ring (Z,+,-) of integers has only itself as a subring. Indeed, 
if S is a subring of Z, then 1 € S$. This implies that finite sums of 1 are in S$. Since 
the negatives of integers in S are also in S, and hence 0 € S, we conclude that 
S=Z. 

Example 8.84. The ring (Z,+,-) of integers is a subring of the ring of rational 
numbers (Q,+,-), which in turn is a subring of the ring of real numbers (R, +,-). 


Non-Example 8.85. Consider the ring (Q,+,-), and define the equivalence rela- 
tion~ on Q bya~ b if and only ifa—b € Z. As a set, we can consider the quotient 
set Q/ ~ as QN[0,1). However, addition and multiplication on Q do not descend to 
the quotient Q/ ~. Note that [1] = [0], so we would have [0] = [1] - [r] 4 [1-r] = [r] 
unless r € Z. This shows in general that one cannot take the quotient of a ring by 
a subring in a meaningful way. 


To be able to take quotients of a ring by a subset, we have the following notion. 


Definition 8.86. A (two-sided) ideal] / of a ring (R,+,-) is a subset satisfying the 
following: 
(1) (J, +) is a subgroup of (R, +). 
(2) For every i€ I andr € R, we have both r-i€ J andi-re TI. 
Observe that if an ideal J of a ring R contains the multiplicative identity e, 


then J = R. For example, Z is not an ideal of the ring (Q,+,-). In fact, the only 
ideals of (Q,+,-) are {0} and itself. 
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We leave it to the reader to imagine what the definitions of left ideal and 
right ideal are (you may check your guess at the “Ideal (ring theory)” Wikipedia 
link for example). Of course, if the ring is commutative, then left ideals and right 
ideals are each two-sided ideals. 


Example 8.87. For the ring of integers (Z,+,-), mZ is an ideal for each positive 
integer m. 


We say that an integral domain is a |principal ideal domain) if each ideal is 
generated by a single element. That is, for each ideal J, there exists an element a 
such that I = (a). 


Example 8.88. The ring of integers (Z,+,-) is a principal ideal domain. 


8.6.1. The quotient ring. Let (R,+,-) be a ring and let I be an ideal of R. 
The quotient set R/I := R/ ~ has as elements the equivalence classes 


(8.148) fr) =rtil={r+ilie I. 


We leave it as an exercise to the reader to show that if we define addition and 
multiplication on R/I by 


(8.149) [r]+[s]:=[r+s] and [r]-[s]:=[r-sl, 


respectively, then (R/I,+,-) is a ring. This ring is called the|quotient ring 


8.6.2. Ideals in the ring of integers. We show that Example 8.87] gives all of 
the ideals in the ring of integers. Let I be an ideal of the ring of integers (Z,+,-), 
which is not trivial; ie., J ~ {O}. 

Exercise 8.45. Prove that IN Zt #90. 
Let S :=IMZ*. By the well-ordering principle, m := min S' exists. 


Exercise 8.46. Prove that l= mZ. Hint: Firstly, show that mZ C I. 


Definition 8.89. We say that an ideal J in a ring (R,+,-) is a prime ideal if 
a,b€ Randa-be€TI implies thatae€ J orbe I. 


Exercise 8.47. Prove that if an integer m > 2 is composite, then the ideal mZ is 
not a prime ideal. 


Exercise 8.48. Let p be a prime number. Prove that the ideal pZ is a prime ideal. 


The quotient ring Z/mZ is denoted by (Zm,+,-). Note that the ring (Ze, +, -) 
has zero divisors in the sense that [2] - [3] = [0]. On the other hand, if p is prime, 
then (Z,,+,-) does not have zero divisors. Exercise: Prove this! 
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8.7. Fields 
In algebra, fields do not mean (sadly!) Fields of Dreams or even force fields for that 


matter! 

8.7.1. Familiar examples. In §8.5] we discussed groups, which have a single 
binary operation, with (Z,+) as an infinite prototype and (Z,,,+) as a finite pro- 
totype. 

Earlier, in 48.5.1] we briefly discussed the notion of a commutative ring, which 
has two binary operations, one called addition and one called multiplication. Here, 
the ring of integers (Z, +, -) is a prototype (there are more general rings of integers). 
Other prototypical rings are the ring of rational numbers Q and the ring of real 
numbers R, both with the same addition and multiplication as for the integers AR 


For the ring of integers, one could say it has the following defect: Not all 
integers have multiplicative inverses inside the set of integers. In fact, the only 
integers with multiplicative inverses are 1 and —1. On the other hand, any non- 
zero rational number has a multiplicative inverse in Q. Similarly, any non-zero real 
number has a multiplicative inverse in R. 


8.7.2. Definition of a field. If we axiomatize these properties, we obtain the 
notion of a field 


Definition 8.90. A field is a set K with binary operations + and - on K satisfying 
the following properties: 


(1) (Associativity) For any x,y,z € K, 


(8.150) (e+y)+tz2=2+(y+2), 

(8.151) (a-y)-z=a-(y-2z). 
(2) (Commutativity) For any x,y € K, 

(8.152) cty=yta, 

(8.153) Vey=y-k. 


(3) (Identity element) There exist elements 0 € K and e € K, with 0 F e, 
satisfying: For all « € K, 


(8.154) e+0=04+2=2, 
(8.155) ve=e-L=L4. 


(4) (Existence of an inverse) 0+ 0 = 0, and for each « € K — {0}, there exist 
elements —a and x~! satisfying 


(8.156) x+(—2) =(-x2)+2=0, 

(8.157) gg + aa tee Se, 
(5) (Distributivity) For any x,y,z € K, 

(8.158) x-(ytz)=x-yton-e. 


?More precisely, the addition and multiplication operations extend from the case of the integers. 
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We note that since addition and multiplication are commutative, in the “iden- 
tity element” and “existence of an inverse” axioms it was redundant to switch the 
order of the additions and multiplications. 


In other words, a field is a commutative ring with the properties that 0 4 e 
and each non-zero element has a multiplicative inverse. 


8.7.3. The fields Q, R, and C. As we have mentioned in the previous sub- 
section, both Q and R are fields. We leave it to the reader to spend a moment to 
check this. 


8.7.3.1. The complex numbers C. One problem with the field of real numbers 
is that not all polynomial equations have solutions! Take for example the equation 


(8.159) a? -+1=0. 


This quadratic polynomial equation has no solutions in the field of real numbers. 
This can be rectified by defining the complex numbers. 


We can define the field of (C,+,-) by introducing an inde- 


terminate i, called the imaginary unit, and making a definition using elementary 
constructions in formal algebra. We sidestep this method by defining the field of 


complex numbers in the following way. 
As aset, we let C be the cartesian product R? = R x R (see §5.2). We 
define |\complex addition + to be the usual vector addition: 


(8.160) (11, y1) + (#2, yo) = (1 + 22, y1 + ya). 


With the motivation of (0,1) corresponding to i satisfying i? = —1, we define 


complex multiplication - by 
(8.161) (@1, 41) + (2, Y2) = (@1%2 — Yry2, Liy2 + y1L2). 
As a special case, we see that 


(8.162) (a) 0.1)= (0), 


We leave it as an exercise for the reader to show that (C,+,-) is a field. This 
is an easy exercise in the sense that each field axiom in Definition |8.90]is straight- 
forward to check, but you do have to check a handful of axioms. 


A natural way to think of the set of real numbers as a subset of the set of 
complex numbers is to define the injection 


¢:R-C 
by 
(8.163) (x) = (2,0). 


Via this injection ¢, we can think of a real number as a special case of a complex 
number. Namely, given x € R, we identify it with ¢(a) = (x,0) € C and still call 
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the image of x under ¢ to be x. So x € C with this notation. In this sense, 
we consider R as a subset of C 


even though it is really ¢(R) C C. 
We also use the notation 


(8.164) i := (0,1). 
With this notation, we write x + yi for (2, y). Complex multiplication becomes 
(8.165) (x1 + yrt) - (w2 + yot) = (102 — yry2) + (w1y2 + 182) 4. 


We abbreviate for z,y € R 


’ 


(8.166) x+0i =: 2, O+ yi = yi. 

We also abbreviate 0 + li = 11 =: 7. The origin is the element 0 = 0+ 0i. 
Let z:=a2+yieC. We define 

(8.167) 2i=z-z=(axt yi): (et yi) = (2? — y*) + 2ayi. 

In particular, 

(8.168) f=; 

that is, z =7 is a solution to 

(8.169) 2+1=0. 


We leave it to the reader to check that z = —i is another solution to (8.169). 
We define z° := 1 and for k € Zt, 


(8.170) Z* = ge g---z, 
Ce — 


k times 


where we have taken advantage of the fact that multiplication is associative to omit 
the brackets. To be more rigorous, we should define z* by induction: 


(ce? t= 4, 
QO) eee"! ior REZ, 
The complex conjugate of a complex number z = x + yi is defined by 
(8.171) Z:= ax — yl. 


The modulus] (or absolute value) of a complex number z = x + yi is defined 
by 


(8.172) |z| = Va? +4. 


This is equal to the distance from (a, y) to (0,0) in the Euclidean plane R?. Observe 
that 


(8.173) le =e ez. 
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The principal value of the argument 
Arg(z) 


of a non-zero complex number z = x + yi is equal to the unique angle (in radians) 
from the positive x-axis to the vector (a, y) satisfying 


Arg(z) € (—7, 7]. 
Example 8.91. (1) Arg(—4) =7. 
(2) Arg (1,—V3) = =a 
0) ae (-aood) = 
The argument] 
arg(z) 


of a non-zero complex number z = x + yt is the set of all angles from the positive 


x-axis to the vector (a, y). So arg(z) is almulti-valued function 
Example 8.92. (1) arg(—4) ={7+2mn: ne Z}. 
(2) arg (1,-V3) = (4 + 2a ne Zh. 


(3) arg (-4. 5) = {22 42nn: ne Z}. 
For any complex number z, we have 
(8.174) z = |z| (cos(arg(z)) + sin(arg(z))2). 


The modulus and argument have the following beautiful properties of Lemma 


[8.93] (we leave it to you to work out or the proofs). 


2122 


Zl 


Figure 8.7.1. Complex multiplication satisfies |z1z2| = |z1| |z2| and arg(z1z2) 
= arg(z1) + arg(z2). 
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Lemma 8.93. Let z1 and zz be complex numbers. Then 
(1) |2122] = |z1| |zel- 
(2) arg(2122) = arg(z1) + arg(z2). 
(3) (Triangle inequality) |z1 + z2| < |z1| + |22I. 


The triangle inequality is actually a special case of the triangle inequality for 
Euclidean n-space (n = 2); see Corollary [5.25] 


Example 8.94. Let 2; =1+ V/3i and z2 = —3 + 3%. We calculate that 
2122 = (1+ V3i)(—3 + 3i) = —3(1 + V3) +. 3(1 — V3)i. 


Observe that 
Jal=2, |z2)=3, |zz2| =6, 
which agrees with Lemma[8.93(1). Further observe that 


T 37 
Arg(z1) = 3° Arg(zq) = rs 
Thus, by Lemma|[8.93/2), we have 
na 30 11 
Arg(z122) = 3 + > 27 = a5 


where we needed to subtract the 27 to obtain an answer in the interval (—7, 7]. 
Thus, by (8.174), we have 


cos ( 11 )--4 sn ( 11 j= 


T T 
12 2 12 2 


8.7.3.2. Field homomorphisms. Roughly speaking, a homomorphism] from one 


algebraic structure to another of the same type (whether it be groups, rings, fields, 
etc.) is a function which preserves the algebraic structures. 


Let (K,+,-) and (K’,+’,-’) be fields and denote their additive and multiplica- 


o] 


tive identities by 0, 0’ and e, e’, respectively. A field homomorphism|from K to K’ 


is a function ¢: K — K’ satisfying the following properties: 
(1) (Preserves addition) ¢(# + y) = ¢(x) +’ d(y). 
(2) (Preserves multiplication) ¢(#-y) = o(x) -’ (y). 
(3) (Preserves multiplicative identities) ¢(e) = e’. 


We leave it as an exercise for you to show that these properties imply the 
following: 


(1) (Preserves additive identities) 4(0) = 0’. 
(2) (Preserves additive inverses) 


(8.175) o(—«) = —¢(2). 
(3) (Preserves multiplicative inverses) If « 4 0, then ¢(a) 4 0! and 
(8.176) g(a") = (e(a))~". 


Here, — and ~! denote the additive and multiplicative inverses, respectively, 
in both fields K and Kk’. 
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Example 8.95. The injective function @ : R — C defined by (8.163) is a field 
homomorphism. 


If a field homomorphism ¢: K — K’ is a bijection, then we say that ¢ is a 
field isomorphism. In this case we say that the fields K and K’ are isomorphic. 

It turns out that for a field isomorphism ¢, its inverse ¢~! is also a homomor- 
phism and hence an isomorphism. 

8.7.3.3. Subfields. Let L be a non-empty subset of a field (K,+,-). We say 
that (Z,+,-) is a subfield of (K,+,-) if Z is closed under both addition and multi- 
plication and if (L,+,-) is itself a field. That is (we leave it as an exercise to show 
that the following list of conditions is equivalent): 


(1) (Closed under addition and multiplication) If 7,y € L, then x+y € L and 
e-yel. 

(2) (Additive inverse exists) If « € L, then —x € L. Since L is non-empty, this 
implies 0 € L. 

(3) e€ L. 

(4) (Multiplicative inverse exists) If c € L — {0}, then a7! € L. 


Example 8.96. The subset 4(R) of C, where ¢ is defined by (8.163), is a subfield. 
Since we like to identify R with ¢(R), 


we can consider R as a subfield of C. 


8.7.3.4. Polynomials. The following result is so important that we may call it 


the Big Fundamental 
8.7.3.5. The Fundamental Theorem of Algebra. The following is the|Fundamental 


Theorem of Algebra 


Theorem 8.97. Let ao,a1,...,an be complex numbers, where n € ZT and an # 0. 
Then the complex polynomial defined by 


(8.177) ie > Ope” =0n8" + agiae? oe ape tag forse Cc 
k=0 
has a zero; t.e., there exists z € C such that f(z) =0. 


Proof. Using that f is a polynomial, we will show: 
Step 1: The function 
ze If (z)| 
achieves its minimum at some point zo € C. That is, there is a zo € C where f(z) 
is closest to the origin. Of course, later we want to show that f(zo) is the origin. 


The infimum of z + |f(z)| exists since it is a non-negative function and hence 
bounded from below by 0. The point is to show that the infimum is attained at 
some point; i.e., the minimum exists (the minimum of a function might not exist 
in general). 

Slightly simplifying assumption: Note that since an # 0, we have 


f(2)| _ lf) ei: | a oe 
oa leas and inf ea “Te [eer 


(8.178) 
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Thus, by dividing f (z) by a,, we may assume without loss of generality that 


An = 1. 


Recall, from Lemma|[8.931), that |zw| = |z||w|. In particular, | — z| = |z|, and 
|2"| = |z|" for n € Z* (the fastidious reader may prove this last fact by induction). 
By the triangle inequality (Lemma [8.93{3) and, for us fastidious ones, induction), 
we have 


(8.179) jzy tee+ + 2n| < Jer] +--+ + len. 
Exercise 8.49. Prove that for every 21, 22,.-.,2n € C, 

(8.180) \zy +--+ + 2n| > |2n| — 21] — +++ — |2n-il. 
Aint: 

(8.181) (21 ++°++ Zn) + (—21) +++ + (—2n-1) = Zn. 


Denote r := |z|, which is equal to the distance from z to the origin 0. The idea 
is that size-wise the leading order term of f, i.e., 2”, should dominate the rest of 
the terms when r is large. Since 


n—1 
f(z) =2"+ S- Anz", 
k=0 


using the triangle inequality (see Exercise |8.49) we may estimate 


n-1 


If > l2e"1- So faxz*| 


k=0 


n-1 
=r" S- |az| r* 
k=0 
n—1 
=r” ( — s lax| a 5 
k=0 


Since each power k — n of r in the sum on the right-hand side of the last line is 
negative, we have 


R=1 
: k-n _ 
im, ‘> la,|r°-” =0. 
k=0 
This implies the key fact that 
(8.182) lim |f (z)| = co. 


Z—> CO 


Recall the following |extreme-value theorem| 


Theorem 8.98. Let f : X > R be a continuous function on a closed and bounded 
subset X of R". The f attains both its minimum and its maximum. 


With this, we can proceed to show that the minimum of | f| is attained. Without 
loss of generality, we may assume that f(0) 4 0 for otherwise we are done. 
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Define the constant M := |f(0)| > 0. Since lim,_,.. |f (z)| = oo, there exists a 
positive real number R such that: 


(8.183) If || >R, then |f (2)| > M =|). 
Consider the closed ball 
B:={z€C: |z|< R}. 


Since the set B is closed and bounded, by Theorem [8.98] the restriction f|, of f 
to B achieves its minimum, since f is a continuous function. That is, there exists 
zo € C such that 


(8.184) Lf (20)| = int If (2). 


We now show that zo is a minimum point of |f| on all of C. Using (8.183), we 
see that 


(8.185) int | (2) > M =| F(0)| > int [fF I=L (o)I, 


with the second inequality because 0 € B. Since C = BU (C — B), by (8.184) and 
(8.185) we conclude that 


If (2o)| = inf |f (2). 
This completes Step 1. 


Im(f) 


f (Zo) 


Figure 8.7.2. The land-dwelling function f sends inputs to the grassy area 
and avoids the lake in the middle. Here, f(zo), who has just learned to swim, 
is the most adventurous output (the origin 0 is the deepest part of the lake). 


The theorem follows immediately from: 


Step 2: If there does not exist a zero of f(z), then we have a contradiction. 


Suppose, for a contradiction, that there does not exist a zero of f(z). Then 
| f (zo)| > 0. So we may adjust the polynomial f (z) by defining 


_ f(@+20) 
wey f (20) 
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We have g (0) = 1 and 


(8.186) lg(z)| >1=|g(0)| forall zEC; 


this is true because |f (z + zo)| > |f (zo)| for all z. 


Since g is a degree-n polynomial, we have 
n 
g(z) = 52 bz = bpz” + bp 12" | +++ + biz + bo, 
k=0 


where b,, #0. Note that bp = g (0) =1. 


Let j be the smallest positive integer such that b; #0. Then bj =--- = bj-1 = 
0, and we have 


g(z) =bp2™ +--+ +527 +1. 


‘ —b; : F by os 
Since Test is a unit vector, we may write it as 
J 
= e159) 


where 7 is the imaginary unit complex number and @ is a real number. Why we do 
this will be clear in a moment. That is, 


(8.187) be) = — |d,|. 


Figure 8.7.3. By a rotation and stretch, we can change the polynomial f to 
the polynomial g, where g(z) = 1+ bj;27 +---bnz”™ and bj # 0. When z is 
small, g(z) © 1+ b;24. For suitable z ~ 0, g(z) has jumped into the lake (see 
(8.188)), a contradiction. 


Keep in mind that we are aiming for a contradiction. Consider points of the 
form z = re’®, where r > 0. We have 


g (re’®) = by rel) 4... 4 bret) +1. 


8.7. Fields 409 


So, assuming that r > 0 is small enough so that r < lb;| 1”, 


we have by (8.187) that 


ie., —|bj|r7 +1>0, 


lg (re“*)| ns S- burkei) + Jojriet® +1 
k=j+1 


So [bel r® — [byl 7? +1 


k=j+ 


n 


D> [Palr*9 — bl | 09 +4, 


k=j+l 


where |b;| > 0. Since 


n 
li |p*-i — |p.| | = — Id, 
a + |be| 7 [25 |bj| < 0 
k=j+1 
and since the function r++ S7p_,.; |b| r*~4 — |b;| is continuous, we have for r > 0 
sufficiently small and @ chosen as above that 
(8.188) |g (ret?) | <1. 


This contradicts |g (z)| > 1 for all z € C from (8.186). The Fundamental Theorem 
of Algebra is proven! 


8.7.4. Finite fields. An interesting topic is that of |factoring polynomials over 
finite fields, We do not discuss this here. 


8.7.5. Algebraic structures that are not quite fields. Of special mention are 


the quaternions|} They form a non-commutative |associative division algebra! Also 


interesting are the octonions, which form a non-associative non-commutative divi- 
sion algebra. 


8.7.5.1. The quaternions. 


Example 8.99. Although we will not go into detail about the quaternions H, their 
elements can be represented by expressions of the form 


(8.189) xt+yi+zj +wk, 

where x, y,z,w € R and i,j,k satisfy 

(8.190) P=aPar=-1 

and 

(8.191) i-j=—j-i=k, j-k=—-k-j=i, k-i=—i-k=j. 


(There are more efficient ways to describe quaternion multiplication.) In any case, 
addition + corresponds to the usual vector addition of (x, y, z, w) and multiplication 
- uses the rules above. 


However, the quaternions (H,+,-) are not a field since multiplication is not 
commutative. 
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8.8*. Quadratic residues and the law of quadratic reciprocity 


In this section we work over the (finite) field Z,, where p is a prime. In this case 


we say that we are working over characteristic p, When we work over the field 


of complex numbers C, we say that we are working over characteristic 0. More 
generally, we have the following. 


Definition 8.100. Given a field K with additive and multiplicative identities 0 
and e, respectively, we say that it has characteristic p € Zt if p is the smallest 
positive integer such that 
(8.192) Sree =o. 

p summands 
If no such p exists, then we say that the field K has characteristic 0. 


Example 8.101. For all primes p, the field Z, has characteristic p. The fields Q, 
R, and C all have characteristic 0. 


We are interested in understanding quadratic equations in the field Zp, for 
example, zeroes of quadratic polynomial equations. 


8.8.1. Lagrange’s Theorem on zeroes of polynomials over Z,. 
[Theorem] in number theory says the following. (We can think of this as saying that 


a weaker form of part of the Fundamental Theorem of Algebra (Theorem [8.97 
holds in Z,.) 
Theorem 8.102. Let 
(8.193) A(x) = ane" + apie” 1 +-+++ a2" +12 +09 
be a polynomial with integer coefficients aj,a1,...,Qn, wheren € Zt. Let p be a 
prime. Then either 
(1) p divides a; for all 1 <i< nil or 
(2) the congruence equation 
(8.194) A(x) = 0 mod p 


has at most n incongruent solutions. 


In other words, given a degree-n polynomial over Z,, where p is a prime, either 
all the coefficients are zero modulo p (in this case we say that the polynomial is 
trivial) or there are at most n distinct roots to the polynomial modulo p. 
Non-Example 8.103. For m = 9, the quadratic equation x? = 0 mod m has the 
three incongruent solutions 0,3,6. More generally, the quadratic equation x? = 
0 mod p?, where p is a prime, has the p incongruent solutions 0, p, 2p, ...,(p—1)p. 


Proof of Theorem [8.102| Without loss of generality, we may assume that a, 4 
0 mod p in the statement of the theorem. For, otherwise, if a, = 0 mod p, then the 
term a,x” = 0 mod p for every x € Z. 


We prove the theorem by induction on n € Z?. 


3In this case, A(z) = 0 mod p for all x € Z. 
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Base case: n = 1. Let 
A(x) =a," +a0, where a; #0 mod p. 

Suppose that x; and x2 solve the congruence equation (8.194). Then 
(8.195) a1,21 + a9 =0=a,22 + ap mod p. 
This implies that 
(8.196) Q,21 = a, £2 mod p. 
Since a; # 0 mod p and p is a prime, we have gcd(a,,p) = 1, and so by Corollary 
[6.21] we obtain 
(8.197) 21 = 2 mod p. 
Thus A has a unique root modulo p (by Lemma|6.38) there exists a root). 


Inductive step: Suppose the statement of the theorem is true for n = k, 
where k € Zt. 


Let 
(8.198) A(x) = apie" t +aga*+---+aon7+a,2+a9, where az41 #0 mod p. 


Without loss of generality, we may assume that A has a root 29 modulo p. We may 


now use the/Polynomial Division Theorem to obtain that there exists a degree- 


k polynomial Q(a) with integer coefficients and a degree-0 polynomial R(x) =c EZ 
(a constant) such that 


(8.199) A(x) = (a — xo) Q(x) +e. 
Since A(x) = 0 mod p, we have c = 0 mod p, and hence 
(8.200) A(x) = (a — 20) Q(x) mod p. 


Since p is a prime, by (8.200) any root modulo p of A(x) is equal to xo or a root 
modulo p of Q(x). By the inductive hypothesis, there are at most k incongruent 
roots modulo p of Q(#). Thus, A(x) has at most k + 1 incongruent roots modulo 
p. This proves the inductive step, and hence by induction we are done. 


Figure 8.8.1. Carl Friedrich Gauss, 1777-1855; portrait by Christian Al- 
brecht Jensen (1840). Wikimedia Commons, Public Domain. 
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8.8.2. Simplifying quadratic equations in Z,. Let p be an odd prime. Con- 
sider the general quadratic congruence equation in characteristic p: 


(8.201) ax” + ba +c=0 mod p, 


where a,b,c € Z and a #0 mod p. Since ged(a, p) = 1, there exist integers b’ and 
c’ such that b = ab! mod p and c = ac’ mod p. Using gcd(a,p) = 1 again, we see 
that (8.201) is equivalent to 


(8.202) x? +b'2+c =0modp. 


Since p is an odd prime, we have gcd(2,p) = 1, so that there exists an integer b”’ 
such that b’ = 2b’ mod p. So we may rewrite the congruence equation (8.201) as 


(8.203) (a +b”)? — (b”)? + =0 mod p. 


Thus, if we let 2’ := 2+" and q := (b")? — c’, then we obtain the equivalent and 
more simpler looking equation 


(8.204) (x')* = q mod p. 
For this reason, we only look at quadratic equations of this last form. 


Example 8.104. Consider the quadratic congruence equation 


(8.205) x? =amod 17 

for all possible choices of a. We list all solutions modulo 17 for 0 < a < 17: 
1? = 16? = 1, Pe 1s ad a? = 14? =D, iP a 1? = 16, 
Si =, eS =o, = 10'S 15, 8? = 9? = 13. 


On the other hand, for a equal to 3, 5, 6, 7, 10, 11, 12, 14, there are no solutions. 
Observe that for each choice of 0 < a < 17, there are either 0 or 2 solutions to 
x? =amod 17 with 0 < x < 17. 


8.8.3. Quadratic residues. Let m be a positive integer. We say that an integer 
q is a\quadratic residue modulo m if there exists an integer x such that 
(8.206) x? =qmodm. 
In other words, an integer qg is a quadratic residue modulo m if in the “Z,, world” 
q is a perfect square! 


Let us call g a quadratic residue remainder modulo m if q is a quadratic 
residue modulo m and 0 < q < m. Since 0? = 0 mod m is a rather trivial case, 
we will assume that q > 0. Note also that 12 = 1 mod m, so that 1 is a quadratic 
residue remainder modulo m for all m > 2. So we have the following: 


0 and 1 are quadratic residue remainders for every modulus m. 
So, a quadratic residue remainder modulo m is a perfect square in the remainder 
modulo m world Ry. 


We say that q is a non-trivial quadratic residue modulo m if m does not 
divide q (ie., g #0 mod m). 


By the example at the end of the preceding subsection, the quadratic residues 
modulo 17 that are strictly between 0 and 17 are 1, 2, 4, 8, 9, 13, 15, and 16. 
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If you like to think ahead, here is a question for you: 


Let p be a prime. When is —1 a quadratic residue? 


After all, —1 is the “next simplest” integer after 0 and 1. 


Figure [8.8.2] is a table of the non-trivial quadratic residue remainders modulo 


m for the first nine integers m > 2. 


(8.207) 


quadratic residues 

1 
1 
1 

1,4 

1,3,4 

1,2,4 

1,4 

1,4,7 

1,4,5,6,9 


a 
SeOaNankwnils 


Figure 8.8.2. The non-trivial quadratic residue remainders modulo m for 
2<m< 10. 


In particular, the quadratic residues modulo 10 are the integers 1, 4,5,6,9. This 


means that the last digit of any perfect square is always one of these five integers. 


Example 8.105. We can use quadratic residues to rule out integers from being 
perfect squares. For example: 


SNe Ne Ne 


1) The integers 10’ + 1 and 10’ + 3 are not perfect squares. More generally, any 


integer of the form 3k + 2 is not a perfect square. 


The integers 10’ + 2 and 10’ +3 are not perfect squares. More generally, any 
integer of the form 4k + 2 or 4k + 3 is not a perfect square. 


The integers 10’ + 7 and 10’ +8 are not perfect squares. More generally, any 
integer of the form 5k + 2 or 5k +3 is not a perfect square. 


The integers 10’ + 10 and 10° + 13 are not perfect squares. More generally, 
any integer of the form 6k + 2 or 6k + 5 is not a perfect square. 


On the other hand, 10° + 4569 = 3163? is a perfect square. We see that it is 
equal to 1+4+5+6+9 = 25 =1 mod 3, 1 mod 4, 4 mod 5, 4567 = 1 mod 6. 
With the help of a calculator, we see that 10° + 4569 = 3163? is equal to 
7- 1429224 + 1, so that it is 1 mod 7. It is equal to 8 - 1250571 + 1, so that 
it is 1 mod 8. It is equal to 9- 1111618 + 7, so that it is 7 mod 9. Clearly it 
is 9mod 10. Of course, all of these congruence formulas are consistent with 
10” + 4569 being a perfect square. 
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8.8.4. The Legendre symbol. We can keep track of what are and what are 


not quadratic residues as follows. The|Legendre symbol is defined by the following: 


For a positive integer m and an integer q, 


r 1 if gis a non-trivial quadratic residue modulo m, 
(8.208) (+) = ¢ —1 if q is not a quadratic residue modulo m, 
0 if gis a multiple of m. 


Example 8.106. (1) For all m > 2, since 17 = 1 < m, we have 


(8.209) i (=) 


(2) Since 2? = 4, 3? —-6 = 3, and 3? — 7 = 2, we have 


mm (==) -@)-0) 


(3) Since the non-zero quadratic residue remainder modulo 3 is 1 and modulo 4 
is 1 and since the non-zero quadratic residue remainders modulo 6 are 1, 3, 4 
and modulo 7 are 1,2,4, we have 


ean) -1=(5)=(2)= (2) = (5) = (a) =(3) = (2) =), 


(4) Since the non-zero quadratic residue remainders modulo 10 are 1, 4,5,6,9, we 
have 


(8.212) ( r) -{ 1 ifg=1,4,5,6,9, 


10 -1 ifq=2,3,7,8. 


Recall that Fermat’s Little Theorem [6.63] says that if p is a prime and if a isa 
positive integer which is not a multiple of p, then 


(8.213) a?-! = 1 mod p. 
Now assume in addition that p 4 2, so that p is odd. Then po is an integer and 
we may rewrite this equation as the “quadratic” equation 


(8.214) (0° = 1) (a"> a 1) 


Since p is a prime, we thus have 


Pp 


(a*?)? —1=0mod p. 


p— 


(8.215) a’? —1=0modp or a’? +1=0; 


that is, if a € Z* is not a multiple of an odd prime p, then 


p-1 


(8.216) a = =+1 mod p. 


For example, if a € Zt is not a multiple of the odd prime 7, then a? = 
+1 mod 7. Table[8.8.1]shows the actual values (+1 or —1). 
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Table 8.8.1. Euler’s criterion (see Theorem [8.109] below) for p = 7: a? = 
1 mod 7 (—1 mod 7) if and only if a is (is not) a quadratic residue. 


a| a> | a> mod7|0<b <7, where b? =a 
1 1 1 1 

2 8 i 3 

3. |) 2¢ —1 none 

4] 64 1 2 

5 | 125 —1 none 

6 | 216 —1 none 


8.8.5. Number of solutions to a quadratic equation modulo a prime. We 
are particularly interested in the case of prime moduli. 


Lemma 8.107. Let p be an odd prime, and let a be an integer that is not a multiple 
of p. Then there are either 0 or 2 solutions to the quadratic congruence equation 


(8.217) a? =a mod p. 
In other words, by definition (8.208) there are 
(8.218) (<) +1 


Pp 
solutions to (8.217). 


Proof. Suppose that 2; and x2 are both solutions to (8.217). Then we compute 
that 

(8.219) (xy — t2)(a1 + 2) = 2? — 22 =0 mod p. 

Thus, since p is a prime, either 


(8.220) %j=22modp or x21 =—xL2 mod p. 


Now, if there are no solutions to (8.217), then we are done. So, we suppose that 
there exists a solution x; to (8.217). By (8.220), there are at most two solutions. 
On the other hand, p — x, is also a solution, and since p is odd, this solution is 
distinct from 2. Therefore, there are exactly two distinct solutions modulo p. 


Non-Example 8.108. On the other hand, for the modulus m = 6 (which is not 
a prime), the quadratic congruence equation x? = 3 mod 6 has the unique solution 


x =3mod 6. 


8.8.6. Euler’s criterion. [Euler’s criterion is a criterion for determining whether 
it is +1 or —1 in formula (216): a°= = +1 mod p. Table is the special case 
p=7T of the following. 


Theorem 8.109. Let p > 2 be a prime, and let a be an integer coprime to p. Then 


(8.221a) at =1 modp ifa is a quadratic residue, 


p-1 


(8.221b) a2? =-I1modp ifa is not a quadratic residue. 
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Proof. (Slow version, where we intersperse examples) Let a be an integer coprime 
to p, and let 


(8.222) Sa := {cE N,-1:¢? #a mod p}, 


where N,-1 = Rp — {0} = 1,2,3,...,p—1. Observe that if a is not a quadratic 
residue, then S, = Ny-1. 

Step 1 = Claim 1. The cardinality of Sq is even. 

For example, by Lemma [6.44] we have that 


(8.223) S; =N,-1— {1,p— 1} = {2,3,...,p— 2}, 


which has cardinality p — 3, which is even since p is an odd prime. 
Another example is S, for p = 7. In this case, the sets in (8.222) are 


S, = {2,4,3,5}, So ={1,2,5,6}, S3 = {1,3,2,5,4, 6}, 
Sa = {1,4,3,6}, Ss ={1,5,2,6,3,4}, Se ={1,6,2,3,4,5}, 


which all have even cardinality. 


Key idea to prove Step 1. Ifc € 5, satisfies c? 4 a mod p, then there exists 
a unique x € S, such that 


(8.224) czr=amodp and «rc. 


Proof of the key idea. Since gced(c,p) = 1, by Lemma [6.38] there exists an integer 
x such that cr = a mod p. By Theorem [6.41] the solution x is unique modulo p. 
Since c? # a mod p, we have x # c. 

Now suppose for a contradiction that x2 = amodp. Then x? = cx mod p. 
Since gcd(c,p) = 1, this implies that z = c mod p (by Corollary [6.21), so that 
x =c since both integers are in N,_1, a contradiction. Thus x? 4 a mod p, which 
implies that « € S,. We have proved the key idea. 


Now, the key idea implies that the elements of S, are partitioned into (un- 
ordered) pairs {c,d} where cd = amod p and c # d. (The reason why we have a 
partition is that if cd) = a = cdz mod p, then d, = d, mod p.) This proves Step 1 
that |S,| is even. 


Step 2. Writing (p —1)! modulo p in a clever way. 


Clearly, we have the disjoint union N,-; = 5, U (Np-1 — S.). Thus, p— 1 
factorial may be written as 


(8.225) -1'= [= TPs. II « 


j€Sa  kENp-1—-Sa 


Consider the first product on the right-hand side. We have that | jes, J 18 equal to 


|Sa| 


> pairs whose products are congruent to a modulo p, and hence 


the product of 


(8.226) II j= a 2 mod Dp. 


JESa 
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Step 3. Deducing the theorem. 


Case 1: a is not a quadratic residue. In this case, S, = Np-1, so that |S,| = 
p—1. Hence 


(8.227) (p—!= J] g=0°* =Sa°™ mod p. 


Case 2: a is a quadratic residue. In this case, there exists r € N,-1 such that 
r? =amod p. This also implies that (p — r)? = a mod p. Since p is odd, we have 
that p—r #r mod p. 
We claim that r and p—r are the only elements of N,-1 whose squares are 
congruent to a modulo p. Therefore |S,| = p — 3. 


Indeed, suppose that s € N,_1 satisfies s? =a mod p. Then 
(8.228) (s+r)(s—r)=s? —r? =a—a=0modp. 


Since p is a prime, this implies that 


(8.229) str=0modp. 


Since s € N,_1, we conclude that s =r or s=p-—r. 


Therefore, when a is a quadratic residue, we have that 


(8.230) II j= a? =a"® mod Dp. 
GESa 
We also have that 


(8.231) II k=r(-r) r? a mod p. 
kEN,-1—Sa 


Therefore, 


(8.232) (p—1)!= II je Il k= a®=(—a) —a’= mod D. 


j€Sq  kENp—1—Saa 


In particular, since 1 is a quadratic residue, we obtain 
(8.233) (p — 1)! = —1 mod p. 
By the way, this reproved4 the < direction of Wilson’s Theorem [6.69] 
With this knowledge, we revisit and to conclude that: 
(1) If a is not a quadratic residue, then (8.227) implies that 
(8.234) -l=a'™ modp. 
(2) If a is a quadratic residue, then (8.232) implies that 


(8.235) 1=a'? modp. 


We have completed the proof of the theorem. 


4By “reprove” we mean to prove again, and not to reprimand or censure. © 
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By the definition of the Legendre symbol (8.208), an immediate consequence 
of the theorem is: 


Corollary 8.110. If p is an odd prime and if a is an integer, then 


(8.236) (2) =a’? modp. 


8.8.7. Statement of the law of quadratic reciprocity. An important fact 
about quadratic residues is the following. 


Theorem 8.111 (Law of quadratic reciprocity). [fp and q are distinct odd primes, 


then the Legendre symbol satisfies the identity 


(8.237) (2) (2) SC er 


q Pp 
We prove this fundamental theorem in the next subsection. 


Example 8.112. Taking p = 5 and q = 7, we have 


eee) (2) (2) =(-1)-(-I)=1=(-1)"*—, 


where we used (Z) = (2). 


Figure [8.8.3] is a table of the prime factorizations of integers of the form n? — 5 
for 3<n< 10. 


n? —5 | prime factorization 


nm 
3 4 2? 
4 11 2? 
5} 20 27.5 
(8.239) 6} 31 31 
7| 44 27.11 
8] 59 59 
9| 76 27.19 
10} 95 5-19 


Figure 8.8.3. The prime factorizations of perfect square minus 5. 


Remarkably, the prime factors are always 2, 5, or a prime that ends in 1 or 9. 
Thus, primes ending in 3 or 7 cannot occur. In other words: 


If p is a prime dividing n? — 5, then p 4 3,7 mod 10. 


We leave it to the reader to conduct an easy search of the literature for a proof of 
this well-known fact. 
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8.8.8. A proof of the law of quadratic reciprocity. Recall from (8.121) that 
for all positive integers m, 


(8.240) Zr, = {[klm € Zm : gcd(k, m) = 1}. 
By Lemma[8.68] (Z*,,-) is an|abelian group} where - denotes congruence class mul- 


tiplication. By Definition [6.71] Euler’s totient function y(m) is equal to its order 
[Zin 
Let p and q be distinct odd primes. Then Z := Z, — {[0]p} is an abelian group 
(cf. Example [8.51). Note that |Z>| =p —1. Let 
—1 
PS >; and = Q:= —_, 


so that 2P = p—1 and 2Q = q-1. Since p and q are odd, we have that P and 
Q are integers. Recall that the law of quadratic reciprocity (8.237), which we will 
now prove, states that 


cn (2) (2) 


As defined in Exercise [8.32] we have the (abelian) product group Z> x Zj. By 
definition, 


(8.242) Ze, x Zt ={(lilp Ug) : 1S is 2P,1< 7 < 2Q}. 


Per definition, multiplication in Z}, x Zj is defined by 


([e’lps Ia) . ([ilp, Lila) = ([etlp, [7ale)- 
Since p and q are distinct primes, by Theorem [6.76] we have 
(8.243) |Zql = (P— 1)(a— 1) = |Z) |ZI- 


In fact, recalling the proof of Theorem[6.76} we observe that by the Chinese Remain- 
der Theorem [6.53) for any integers a,b, the system of linear congruence equations 


(8.244a) k=amodp, 
(8.244b) k =bmod q 


has a solution kg. Moreover, an integer k is a solution to (8.244) if and only if 
k = ko mod pq. Thus, we can define a function ® : Z) x Zi > Zp, by 


(8.245) ®((a]p, (bq) = [Alpa: 


where k is a solution to (8.244). Indeed, given [a], € Z} and [b|, € Z%, the Chinese 
Remainder Theorem implies that [A], € Zpq is well-defined. Moreover, suppose for 
a contradiction that gced(k, pg) > 1. Since p and q are primes, this implies that p|k 
or q|k, either of which contradicts (8.244), p Ja, and qb. Thus gcd(k, pq) = 1, so 
that [k]p, € Z5,. We have proved that ® is a well-defined function} 


5To wit, the unique solvability of an equation yields to a well-defined function. 
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We leave it as an exercise to show that ® is a group homomorphism (as defined 

in 985.13). We then observe that its inverse function is the homomorphism UV = 
~!: Zs, — Zs x Z* defined by 

(8.246) W([k] pq) = ([k]p, [K]q)- 
Therefore, ® : Zi, x Zi — Z), is a group isomorphism. 

For convenience, we denote with a bit of ‘Jabuse of notation’ 

(i, 9) == (lip: Lia) € Zp x Z}. 

Consider the {normal subgroup] U := {(1,1),(—1,—1)} of Zs x Zz and the corre- 
sponding [quotient group] (see 


(8.247) G = (Z; x Z5)/U, 
which is an abelian group. Note that |U| = 2 and |G| = 2PQ. We have 
(8.248) Ga={Ga)U +1516 2 Ri <7< Ch, 


expressed as distinct elements, where we used that the coset (i, 7)U is equal to the 
coset (p —i,qg—j)U (recall that [—-7], = [p— iJ») 4 In G, consider the product of 
its 2PQ elements, which is the coset 


2P Q 


(8.249) g:= | [Tea 


i=1 j=1 


Now, summing over 7 first and over 7 second, 


2P Q 2P Q 
(8.250) Hea =IL |? TL 
i=1 j=l i=l j=l 
2P 
= [1 @?.@) 
i=l 
= (Py Or). 
On the other hand, = i = —(q—7) mod q for 1 <i < Q, we see that 
Q 
ors ie [[ (- @-9) = ©1)9(2@)! mod gq, 
i=1 i=] 


since iba 1(¢— 7%) = 2Q- (2Q —-1)---(Q+1). From all of this we obtain that the 
product of the 2PQ elements of G is equal to the coset 


(8.251) g = ((2P)!%, (2Q)!? (-1)?2)U EG. 
Claim. 
(8.252) G = {([k]p, [Alg)U : 1<& < (pq— 1)/2, ged(k, pq) = 1}. 


By the claim, we have that 


(8.253) g=|[% HU = (11 k]] 7 U. 
k k 


k 


5 This allows us to sum j from 1 to Q instead of 2Q. 


8.8*. Quadratic residues and the law of quadratic reciprocity 421 


where the products in the display are over all integers k satisfying 1 < k < (pq—1)/2 
and gcd(k, pq) = 1. Now, each product above may be expressed as (dividing by a 


congruence class means multiplying by its inverse; see also Remark [8.113] below) 
(8.254) 
2P . 772P 2P 2P aq 
Il = [i-17- T=, 0p +4) - Ti p + @)-- Ti. ((@ — 1p + IT, (Qp + 4) 
be lq . 2q eee Pq 
2P)\@ 
= ( d mod p, 
q 


by cancelling the factors P! in the numerator and denominator (I (Qp+i)=P! 
mod P) and since for each 0 < 7 < Q—1, we have 


2P 
[ [Ge +1) = (2P)! mod p. 


i=1 


Similarly to (8.254), we have 


(2Q)!P 
(8.255) IT me mod q. 


Remark 8.113. For the first equality in (8.254), we used the fact that we have 
the set equality 
{ke Z:1<k< (pq—1)/2, ged(k, pq) = 1} 
=| U U tip+au U {ep+4} ] - {0,29,..., Pa}. 
0<j<Q 1<i<2P 1<i<P 


This uses the fact that the integers coprime to pq are those that are not multiples 
of p or gq. 


Recall that Euler’s criterion (Theorem |[8.109) says that 
(8.256) ¢ = (2) mod p and p?= (2) mod q. 


From (8.253), (8.254), (8.255), and this we obtain that (note that 1 and —1 are 


each their own inverses modulo p or q) 


(8.257) g= (ery? (2) ,(2Q)!P (2)) U. 


Finally, by comparing (8.251) and (8.257), we obtain 


ts (2) (2)--9" 


which is the law of quadratic reciprocity! The proof of Theorem [8-11]]is complete. 
© 


422 8. Congruence Class Arithmetic, Groups, and Fields 


8.8.9. Supplements of the law of quadratic reciprocity. Fundamental facts 
are often true for a multitude of reasons. In this subsection we give alternative 
proofs of some special cases of the law of quadratic reciprocity, called supplements. 
In particular, we consider the solvability of the quadratic congruence equations 
x? = —1 mod p and 2? = 2 mod p. 


Table 8.8.2. Examples of the affirmative case of the first supplement: So- 


lutions to «2 = —1 mod p for the first several primes that are congruent to 
1 mod 4. If z is a solution, then so is p— x = —z. 
Solutions to x? +1 for 

p | 2? =—1 mod p | the solution 
5 2.3 5, 10 
13 5, 8 26, 65 
17 4,13 17, 170 
29 12,17 145, 290 
37 6, 31 37, 962 
41 9, 32 82, 1025 
53 23, 30 530, 901 
61 11, 50 122, 2501 


Proposition 8.114 (First supplement to quadratic reciprocity). Let p be an odd 
prime. The quadratic congruence equation 


(8.259) x? =—1 mod p 

has a solution if and only if 

(8.260) p=1mod 4. 

In this case, by Lemma|8.107] there are exactly two solutions modulo p. 


In other words, 


—1 
(8.261) (=) =1 ifandonlyif p=1mod4. 
Pp 


Proof. By Euler’s criterion (in the form of Corollary 8.110) with a = —1, 


(8.262) (—1)*= mod p. 


— 

| 

Sa” 
Ill 


Since the possible values for both sides of this congruence equation are in the set 
{—1,1} and since p > 2, we thus have 


(8.263) (=) = (-1)"2. 


The lemma follows since the right-hand side is equal to 1 if p= 1 mod 4, whereas 
it is equal to —1 if p= 3 mod 4. 
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Table 8.8.3. Examples of the affirmative case of the second supplement: So- 


lutions to x? = 2 mod p for the first several primes that are congruent to 
+1 mod 8. 
Solutions to | «x? — 2 for 

p | x? =2mod p | the solution 

7 3, 4 7,14 

17 6, 11 34, 119 

23 5, 18 23, 322 

31 8 23 62, 527 

41 17, 24 287, 574 

AT 7, 40 47, 1598 


Proposition 8.115 (Second supplement to quadratic reciprocity). Let p be an 
odd prime. The quadratic congruence equation 


(8.264) x? = 2 mod p 
has a solution if and only if 
(8.265) p= +1 mod 8. 


p-l 


Exercise 8.50. Prove that p—(—1) > is divisible by 8 if and only if p = +1 mod 8. 


In other words, (2) = 1 if and only if p= +1 mod 8. An elegant way of saying 
this is the formula 


2 p2-1 

8.266 —|}=(-l)®. 

(8.266) (2) =n 

Exercise 8.51. Prove that if m,n are odd integers congruent modulo 8, then 
m?— n2— 

(8.267) (-1)"= =(-1)"=" 


To prove the proposition, we will use the following result. 
Lemma 8.116. Let p be an odd prime. The quadratic congruence equation 
(8.268) a? =2mod p 


has a solution if and only if the number of solutions to the quadratic congruence 
equation 


(8.269) x? + y* =2 mod p 
is divisible by 8. 
Proof. The key idea is an observation about the set of solutions to (8.269). Suppose 


that (x,y) is a solution to (8.269) of the “first type”, which we define by the 
conditions 


(8.270) x#0modp, y#Omodp, and «4 +y mod p. 


Then (x,y) is one of the 8 distinct solutions 


(8.271) (2,9), (@,—w), (—259)s (-2,-9), 2), ( —2), (ye); (2): 
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For example, for p = 7 we have no solutions of the type above. We only have 
the 4 solutions 


(8.272) (+1, +1) 
and the 4 solutions 
(8.273) (+3,0), (0,+3). 


(Note that —3 = 4 mod 7.) So we have a total of 8 solutions. 
By (8.270), a solution to (8.269) is not of the first type if and only if x = 
+y mod p or exactly one of x and y is congruent to 0 modulo p. 


So next, we say that a solution to (8.269) is of the “second type” if it is of the 
form 


(8.274) (x,x), (2,—2«), 
where 2? = 1 mod p. This implies 
(8.275) ee A: 


Since the prime p is greater than 2, there are 4 of this second type of solution. 


If a solution to (8.269) is not of the first two types, then it is of the “third 
type”, which we define to be of the form 


(8.276) (z,0), (—2,0), (0,2), (0,—2), 
where 
(8.277) az” =2 mod p; 


that is, x satisfies (8.268). 


So, altogether, we have a multiple of 8 numbers of solutions of the first type, 
we have 4 solutions of the second type, and we have 4 solutions of the third type 
if (8.264) has a solution, whereas there are 0 solutions of the third type if (8.264) 
does not have a solution. We have proved the lemma: 


Equation (8.268) has a solution if and only if the number of solu- 
tions to (8.269) is divisible by 8. 


We now give a proof of the second supplement to quadratic reciprocity. 


Proof of Proposition |8.115} By Lemmal8s.116} (8.268) has a solution if and only 
if the number of solutions to (8.269) is divisible by 8. 


So we consider the quadratic congruence equation (8.269) for (x,y). Make the 
change of variables 


(8.278) cxe=at+l, y=at+1 modp. 


Then equation (8.269) is equivalent to the equation for (a,t) € Z? = Z x Z that is 
modulo p, 


(8.279) 2 = (a +1)? + (at + 1)? = a2 (e? +1) + a(2t +2) +2. 


Assume that a 4 0. Then this equation is equivalent to the easier to solve (in 
hindsight) congruence equation 


(8.280) a(t? +1) = —2(¢+ 1) moda. 
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Suppose that ¢ is such that t? = —1_modp. Then (8.280) implies that t = 
—1 mod p, which is a contradiction since p > 2. Therefore 


(8.281) t? #-—1 mod p. 
Since p is a prime, for every integer t such that t? 4 —1 mod p, there exists a 
unique integer a modulo p such that (a,t) solves (8.280). 


Because, by (8.218), there are (+) +1 solutions to t = —1 mod p, we conclude 
that there are 


woe) (2) +i)-2-(@)-e- 


solutions to (8.280), where the last equality is by (8.263). 


So, from Lemma|8.116] there are solutions to (8.268) if and only if p— (-1)* 
is divisible by 8. Finally, we leave it to the reader to check that this condition is 
equivalent to p = +1 mod 8. 
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Hint for Exercise Let a € Z. Then [a]2 = [1]2 if and only if there exists an 
integer k such that 


(8.283) a—1=2k. 

Continue. 

Hint for Exercise The condition on n is equivalent to 
(8.284) Z = [ng U [1]3 U [2]s. 


Hint for Exercise m = 6. Explain why. 


Hint for Exercise[8.4] The inverse of the function f is the function g : Zm > Rm 
defined by g([r]m) = 71 for 0 <r < m. Show that f and g are inverses of each other. 


Hint for Exercise One way to prove this is to observe that |Z3| = 3 and to 
show that the three congruence classes on the right-hand side of (8.17) are distinct. 


This generalizes to the following: For any positive integer m and any integer b, 
(8.285) Zm = {[b]m; [b+ Im,---,[b+m—- 1m}. 
Hint for Exercise Suppose that a,b € Z are such that [a]m = [b]lm. By 
definition, this means that 
(8.286) a—b=qm_ for some q€ Z. 
Apply Theorem [4.16 


Hint for Exercise Suppose that [a]m[blm = [1]m. This is equivalent to 
ab = 1 mod m. Continue and then apply Theorem [4.20] 


Hint for Exercise Such pairs are in one-to-one correspondence with pairs of 
integers (a,b), where 0 < a,b < 24, satisfying 


(8.287) 24 divides ab. 
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Such pairs (a,b) are 


(2, 12), 

(3,8), (3, 16), 

4,6), (4,12), (4, 18), 

6,8), (6,12), (6,16), (6,20), 

8,9), (8,12), (8,15), (8,18), (8, 21), 
9, 16), 

10, 12), 

12,12), (12,14), (12,16), (12,18), (12,20), (12, 22), 
15, 16), 

16,18), (16,21), 

18, 20). 


Hint for Exercise [a] has an inverse if and only if there exists an integer 
b such that [alm[b]m = [1]m. Continue and use Theorem [4.20] 


a for Exercise [8.10] (1) [1~"]g = [1]s, [3~"]s = [83]s, [5~']s = [5]s, [7~1]s = 
7\s- 


Hint for Exercise [8.11} For example, for a = 2, [2~']7 = [4]7. Thus 2 = 
4-3 mod 7, which is equivalent to x = 5 mod 7. 


Hint for Exercise [8.12] If gcd(a,m) = ged(b, m) = 1, then ged(ab, m) = 1. 


Hint for Exercise The properties of reflexivity, symmetry, and transitivity 
are easy to check. 


Observe that |x| > 0 for all x € R”. So the equivalence classes of ~ are the 
sets 


(8.288) {xER” : |x| =r}, 


where r is a non-negative real number. When r > 0, this is the ((n—1)-dimensional) 
sphere of radius r. When r = 0, this is a single point, the origin. 


Hint for Exercise |[8.14} Reflexivity: Let A € GL(n,R). Taking P = I the 
identity matrix, we have 


(8.289) Awl “AT = A; 


Symmetry: Suppose that A,B € GL(n,R) are such that A ~ B. Then there 
exists an invertible real n x n matrix P such that B = P~'AP. This implies 


(8.290) Pap =P Pp APP =P Po APP) — 7A A, 
Since P = (P~1)~1, this says that 
(8.291) A=Q7'!BQ, where Q=P7!. 


Since Q is an invertible real n x n matrix, we conclude that B ~ A. 
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Transitivity: Suppose that A,B,C € GL(n,R) are such that A ~ B and 
Bw~C. Then there exists an invertible real n x n matrices P and Q such that 
B=P7-'AP and C=Q7!BQ. This implies that 
(8.292) C=O" 80 =0C=0 PAPO =(PO) Ao). 

Since PQ is an invertible real n x n matrix, we obtain that A ~ C. 

We conclude that ~ is an equivalence relation. 


Hint for Exercise Let ~ be an equivalence relation on a set X. Firstly, 
clearly 


(8.293) U (2) 


since x € [a] for alla € X. 

Secondly, suppose that x,y € X are such that [2] [y] 4 0. Then there exists 
z € X such that z € [a] and z € [y]; that is, x ~ z and z ~ y. By the transitivity 
of ~, we obtain that x ~ y. 

Now, if w € [a], then w ~ x. Since x ~ y, we have w ~ y, so that w € [y]. 
Therefore, [x] C [y]. Similarly, [y] C [a]. Therefore [xz] = [y]. This proves that 
E = {[a]: x € X} is a partition of X. 


Hint for Exercise We leave the proof of reflexivity and symmetry to you. 
Transitivity is true because 


zryandy~z qdk,@€Z, ex-y=kLl, y-z=L 
=> dk,@¢Z,«—z=(k4+0)L 
> L~z 


and because the following is true: if k,@ € Z, then k+¢ € Z. This proves that ~ 
is an equivalence relation. 


Let 
(8.294) R/LZ = R/~ 


denote the set of equivalence classes of ~. We can define a bijection between this 
set and the circle C of circumference L in the complex plane C. Let 


(8.295) c={eec: i=}. 
Define 

(8.296) b:R/LZ3C 

by 

(8.297) b([r]) =e* forx eR. 


One checks that the function 6 is well-defined and in fact a bijection. 
Yes, this is analogous to taking a string of length L and gluing its ends together. 
We have the quotient function 


(8.298) 7:R—>R/LZ 
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defined by 

(8.299) n(x) = [a]. 
Restricting this function to 

(8.300) am: (0, L] > R/LZ, 


we obtain a surjection. One checks that this function is an injection (and hence a 
bijection) except for the fact that 


(8.301) m(0) = a(L). 


So, we can consider R/LZ as being obtained from the interval [0,Z] by “gluing” 
(or “identifying” ) the two points 0 and L together. 


Hint for Exercise Relexivity is obvious, and symmetry is easy. For tran- 
sitivity, a fact that contributes to this is: The implication 


(2,0) ~ (—a,1) and (—2,1) ~ (x,0) = (2,0) ~ (2,0) 
is true. 


Hint for Exercise Figure [8.9.1] shows a sequence of pictures from the 


Klein bottle Wikipedia page 


Figure 8.9.1. Visualizing the Klein bottle as a quotient. Wikimedia Com- 
mons. Author: Inductiveload. Work released by author into the public do- 


main. 
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Hint for Exercise This verification is relatively routine. 
Hint for Exercise [8.20] The addition table for parity is given by Table 
The tables are basically the same. 


Table 8.9.1. Addition table for parity. 


ra) | even odd 
even | even odd 


odd | odd even 


Hint for Exercise This is similar to the previous exercise, where here we 
replace addition by multiplication. 


Hint for Exercise This verification is also relatively straightforward. 
Hint for Exercise The addition table for R3 is given by Table [8.9.2] 


Table 8.9.2. Addition table for Rg. 


mr oOf|® 


0 
0 
1 
2 


CON FIR 
RP onlwy 


Hint for Exercise The main point is that if a,b € R,», satisfy a+b >m, 
then 0<a+b—m<m. 


Hint for Exercise [8.25| For m = 2, (Rm, ©®) is essentially the same as Example 
8.39 


Hint for Exercise One easily checks that [0]m is the unit for +. Associa- 
tivity and commutativity of + for congruence classes follows from associativity and 
commutativity of + for integers, respectively. One also sees that the inverse of [a],, 
is [—a] since —m is the inverse of m for addition of integers and since [0], is the 
unit for +. We conclude that (Zm,+) is an abelian group. 


Hint for Exercise We see that [1] is the unit for -. However, [0], does 
not have an inverse. Therefore, (Z,+) is not a group. 


Hint for Exercise [8.28] The point is that (Zj,-) is defined so that multiplicative 
inverses exist. 

Hint for Exercise [8.29| The multiplication table for (D2,-) is given by Table 
8.9.3] 

Hint for Exercise We have m = ab, where 1 < a,b < m. In particular, 
neither [a], nor [b]m has a multiplicative inverse. 


Hint for Exercise The point is that two elements, where both elements are 
powers of the generator a and where the powers are any integers, must commute. 
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Table 8.9.3. Multiplication table for (Dg,-). 


OSs 2: 
Qa 7 8 FIR 
es oO F QIaQ 
Ss Fa tse 
me aeacrala 


Hint for Exercise Let eg and ey be the identity elements of G and H, 
respectively. Then one easily checks that (ecg, e#) is the identity element of G x H. 


Given an element (g,h), one also easily checks that (g~1,h~+) is its inverse. 
Finally, associativity is straightforward: 
((91, 1) © (g2,h2)) (93, hs) = (91 - 92, ‘ * ha) 0 (93, ha) 
= (("1 93; (hy * hz) * hs) 
Gd (92 +93), hi * (he * hg)) 
= (g1,h1) © ((92, h2) © (93, hs). 
Hint for Exercise [8.33] (1) Yes. (2) No. (3) No. 


Hint for Exercise This is intuitively easy, but perhaps clumsy to write a 
rigorous proof of. 


Hint for Exercise Let S$! denote the unit complex numbers. Firstly, since 
|1| = 1, we have 1 € S$, and since 1 is the unit of C*, it is the unit of S?. 


Secondly, suppose that z,w € $+. Then |z| = |w| = 1. We compute that 
(8.302) |zw| = |z||w] =1-1=1. 


So zw € $1, which shows that $1 is closed under complex multiplication. 


Thirdly, given z € $1, since z 4 0, its inverse z~! € C* exists. We check that 
g 

z|_l_ ll _, 

Iz?| lz zi? 


since |z| = 1. Thus z~+ € S'. This proves that ($1,-) is a subgroup of (C*,-). 


(8.303) jz 


Hint for Exercise [8.36] By the definition of xH, it is easy to see that fy is a 
surjection. We now prove that f, is an injection. Suppose that f,(hi) = fx(hg). 
Then xh; = xhg. Multiplying this by x—! yields hy = ho. 


Hint for Exercise The subgroups of D2 are 

(8.304) {1}; {1, a}, {1, b}, Ths Do. 

Hint for Exercise Again, [1]m is the unit for -. Let [a], € Z*,. By the 
definition of Z*,, there exists a unique element [b],, € Z*, such that 

(8.305) [O)m * [alm = [a]m - [lm = [1]m- 

Thus, every element has an inverse. Associativity and commutativity of - for congru- 


ence classes follows from associativity and commutativity of - for integers. There- 
fore, (Z*,,-) is an abelian group. 


m? 
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Hint for Exercise The multiplication table for Zj, is 


(Ziz,©) | Uh [52 [72 [Dre 
[1i2 | (U2 [5hi2 [72 [DD na 
(8.306) [5]ia [5ji2 ([Lja2— [A LJa2 [a2 
[T]12 [712 (11Ji2 [5lx2 [7] 12 
[112 | [72 ([Sju2[Ufx2 [It 
The multiplication table for Zj; is 
(Zi5,©)) [is [2lis (Alas [7a (8Jrs— (12Jas [13}15 [14] 
(11s Vis [2jis (Alas [75 (85 (1 Afas = [18)as  [14]as 
[215 25 [4jis (8Jis (145) [Ifis  [7las [A Ufas [13]a5 
(A]is Ais ([8)i5 (AJas [13)15 [2Jas (14]15 [Vas (Las 
(8.307) [15 This = [14]is [13)i5[4Jas [11Jas [2fn5 [Aus [8] 
[8]15 8jis = (Aas. (2Jas [12 Jas (Alas = [13]as  [14]as [TJs 
(W1Jis5 | (Udjis [las [14)as  [2fr5[13}i5  [UJas [Blas [Als 
(13}i5 | [13}is  [11jis [7s (Ais [14]is = [8Jas [14Jas [2s 
(14]15 (4]is  [13)a5  [11Jas (8a [Tas (Alas = (2Ja5 [1 Jas 


Hint for Exercise [8.40} Firstly, recall that the cardinality 

(8.308) IZ*,| = gm). 

Define the function f : R — Z*, by 

(8.309) f(a) = [a|m.- 

By definition, |R| = y(m). We claim that f is an injection, from which it follows 


that f is a bijection since the cardinalities of the domain and codomain are equal. 
We then are able to conclude that 


(8.310) {[a]m:a€ R} =Z*.. 


To finish the exercise, we now prove that f is an injection. Suppose that a,b Ee R 
are such that 


(8.311) f(a) = f (0). 


Then [a]m = [6]m. By definition, distinct elements of R are incongruent modulo 
m. So, if a and b are distinct, then they are incongruent modulo m. In this case, 
[alm 4 [b]m, which is a contradiction. We thus conclude that a = b. This proves 
that f is an injection. 


Hint for Exercise Theorem [8.59] 


432 8. Congruence Class Arithmetic, Groups, and Fields 


Hint for Exercise One isomorphism takes 1 to m, and the other isomor- 
phism takes 1 to —m. 


Hint for Exercise Recall from (8.75) that 


(8.312) Te {itl (Blah: 

One (easily) checks that the bijective function 

(8.313) f:Z.7 23 

defined by 

(8.314) f([O]2) = [1Ja, f(AJ2) = Bla 
is an isomorphism; that is, 

(8.315) Ff (lala + [b]2) = F([al2) - F([b]2). 
For example, 

(8.316) F (2 + [12) = F(O]2) = [4 = [Ja - [3]a. 
Hint for Exercise [8.44] Recall that 

(8.317) Za {eOw"" eC sk e Za) 
and f : Zm — Zm is defined by 

(8.318) f([Klm) = 8 27*/™, 


Firstly, we show that f is well-defined. The reason why this is an issue is 
because the input (on the left-side of the formula defining f) is [k]m, whereas the 
right-hand side a priori depends on k instead of [k]m. 


Suppose that [k]m = [€lm. We need to show that f([k]m) = f([Qm). This is 
equivalent to 
(8.319) el amkim — ei2xtim. that ig, ef 2™(h-O/m — 1, 


This is true if and only if (k — 0)/m is an integer; i.e., k — @ is divisible by m. This 
is true by our hypothesis that [A] = [¢]m. We conclude that f is well-defined. In 
fact, since we didn’t just prove an implication, but rather the biconditional 
(8.320) f([R]m) = Fm) = [k]m = [4lm, 
we actually proved that f is an injection. Since |Z,,| = |Z,,| =m, this implies that 
f is a bijection. 
Secondly, given [a]m, [b]m € Zm, we have 

F(la]m + [blm) = F([a + b]m) 
= e! 2m(a+b)/m 
= e! 2ra/m ot 2nrb/m 

= f (alm) + f([b|m)- 

Thus, f is a group homomorphism. 


Since f is also a bijection, we conclude that f is an isomorphism (it is easy to 
see that the inverse function is a group homomorphism). 
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Hint for Exercise Since I # {0}, there exists a non-zero integer a in I. 
Then |a| € I. 


Hint for Exercise Suppose that a € I —mZ. Apply the Division Theorem 
to divide a by m to get a remainder r. What can you say about r to obtain a 
contradiction? 


Hint for Exercise Since m is composite, there exist integers 1 < a,b <m 
such that ab =m. Explain why a ¢ mZ and b € mZ. 


Hint for Exercise |8.48} Suppose that integers a,b satisfy ab = pk for some 
integer k. Then p divides ab. Continue. 


Hint for Exercise Using the hint and the (extended) triangle inequality, 
we compute that 


[Zn] = [C21 + +++ + Zn) + (-21) +++++ (—2n-1)| 
Slate tanalt|— alte t| nl 
= Z1 ee Zn| + |2i| +--+ + |Zn-1- 
Therefore, 
(8.321) \zy +--+ + 2n| > len] — lea] —- ++ — |2n—il- 


Hint for Exercise [8.50} (<) (i) Suppose that p = 1 mod 8. Then (aye =i. 
so that 


(8.322) p-(-1)** =p-1 
is divisible by 8. 
(ii) Suppose that p = —1 mod 8. Then (ire = —1, so that 


Pp 


(8.323) p—(-1)*? =p+1 
is divisible by 8. 


(=) Suppose that p — (—1) 
just need to rule out that p= 4 


®>* is divisible by 8. Firstly, p must be odd. So we 
t3 mod 8. 
p-1 
2 


(i) Suppose that p = 3 mod 8. Then (—1)"2— 


pri 


(8.324) p—(-1)'? =p+l1 
is not divisible by 8, a contradiction. 
(ii) Suppose that p = —3 mod 8. Then (—1)°= = 1, so that 


= —1, so that 


(8.325) p-(-1)*> =p-1 
is not divisible by 8, a contradiction. 
Hint for Exercise Note that if & is an odd integer, then k = 2€+ 1 for 
some integer @, so that 
k?-1 = &(€+1) 
8.326 a 
(8.326) 5 4, 
which is an integer (since @ or £ + 1 is even). 
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We compute that 


man 
8 . 


=(-ye" 
Since m and n are congruent modulo 8, we have that >“ is an integer and that 
m+n is even. Therefore the ratio in the display above is equal to 1. 
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